Data Mining System Applied to Population Databases for Studies on Lung Cancer
J. Pérez (),
F. Henriques (),
R. Santaolaya (),
O. Fragoso () and
A. Mexicano ()
Additional contact information
J. Pérez: Centro Nacional de Investigación y Desarrollo Tecnológico
F. Henriques: Fundação Nacional de Saúde
R. Santaolaya: Centro Nacional de Investigación y Desarrollo Tecnológico
O. Fragoso: Centro Nacional de Investigación y Desarrollo Tecnológico
A. Mexicano: Centro Nacional de Investigación y Desarrollo Tecnológico
Chapter Chapter 13 in Data Mining for Biomarker Discovery, 2012, pp 227-246 from Springer
Abstract:
Abstract This work addresses the problem of finding the mortality distribution for lung cancer in Mexican districts, through clustering patterns discovery. A data mining system was developed which consists of a pattern generator and a visualization subsystem. Such an approach may contribute to biomarker discovery by means of identifying risk regions for a given cancer type and further reduce the cost and time spend in conducting cancer studies. The k-means algorithm was used for the generation of patterns, which permits expressing patterns as groups of districts with affinity in their location and mortality rate attributes. The source data were obtained from Mexican official institutions. As a result, a set of grouping patterns reflecting the mortality distribution of lung cancer in Mexico was generated. Two interesting patterns in northeastern and northwestern Mexico with high mortality rate were detected. We consider that patterns generated by the data mining system, can be useful for identifying high risk cancer areas and biomarkers discovery.
Keywords: Lung Cancer; Data Mining; Data Warehouse; Data Mining Technique; Lung Cancer Mortality (search for similar items in EconPapers)
Date: 2012
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:spochp:978-1-4614-2107-8_13
Ordering information: This item can be ordered from
http://www.springer.com/9781461421078
DOI: 10.1007/978-1-4614-2107-8_13
Access Statistics for this chapter
More chapters in Springer Optimization and Its Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().