Classifying Noisy and Incomplete Medical Data by a Differential Latent Semantic Indexing Approach
Liang Chen (),
Jia Zeng () and
Jian Pei ()
Additional contact information
Liang Chen: University of Northern British Columbia
Jia Zeng: University of Northern British Columbia
Jian Pei: Simon Fraser University
A chapter in Data Mining in Biomedicine, 2007, pp 169-176 from Springer
Abstract:
Abstract It is well-recognized that medical datasets are often noisy and incomplete due to the difficulties in data collection and integration. Noise and incompleteness in medical data post substantial challenges for accurate classification. A differential latent semantic indexing (DLSI) approach which is an improvement of the standard LSI method has been proposed for information retrieval and demonstrated improved performance over standard LSI approach. The key idea is that DLSI adapts to the unique characteristics of individual record/document. By experimental results on real datasets, we show that DLSI outperforms the standard LSI method on noisy and incomplete medical datasets. The results strongly indicate that the DLSI approach is also capable of medical numerical data analysis.
Keywords: Medical data classification; Latent semantic indexing; Differential latent semantic indexing (search for similar items in EconPapers)
Date: 2007
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:spochp:978-0-387-69319-4_10
Ordering information: This item can be ordered from
http://www.springer.com/9780387693194
DOI: 10.1007/978-0-387-69319-4_10
Access Statistics for this chapter
More chapters in Springer Optimization and Its Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().