EconPapers    
Economics at your fingertips  
 

First Theory of Cancer Gene Data Analysis by 169 Microarrays—Four Universal Data Structures of Discriminant Data

Shuichi Shinmura ()

Chapter Chapter 5 in The First Discriminant Theory of Linearly Separable Data, 2024, pp 219-248 from Springer

Abstract: Abstract The new discriminant theory (Theory1) analyzed six microarrays having two classes. The Revised IP Optimal LDF (RIP) finds the minimum number of misclassifications (MNMs) and shows that all are zero, indicating that the six microarrays are linearly separable data (LSD). Program3 and Program4 coded in LINGO, can split microarrays into many exclusive Small Matryoshkas (SMs) and Basic Gene Sets (BGSs). SMs and BGSs allow us to discriminate between normal and cancer patients (or two cancer classes) with just a few genes. We confirmed these facts by analyzing 163 2nd-generation microarrays registered on the GSE database after 2007. The tenfold CV (Program2) finds SMs and BGSs and shows that many average error rates (ERs) for 10 validation samples (M2) are zero. Therefore, we consider these gene sets valuable for cancer gene diagnosis. Moreover, we find four universal data structures of microarrays (Fact3). Thus, we can establish the first cancer gene data analysis theory in 2020 (Theory2). This chapter explains the details of the SM decomposition of Golub data in Program3. Surprisingly, Program3 can find 723 constant-value genes in Golub’s data and omit them from the analysis. This result shows a convenient ability when other high-dimensional measurement values include meaningless constant values.

Keywords: The Minimum Number of Misclassifications (Minimum NMNM); Linearly Separable Data (LSD); Small Matryoshka (SM); Basic Gene Set (BGS); DF Gene Set by Logistic Regression; Four Universal Data Structures of LSD (Fact3); Six 1st-generation microarrays; 163 2nd-generation microarrays; Four LINGO Programs; Signal Data (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-981-99-9420-5_5

Ordering information: This item can be ordered from
http://www.springer.com/9789819994205

DOI: 10.1007/978-981-99-9420-5_5

Access Statistics for this chapter

More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-06-01
Handle: RePEc:spr:sprchp:978-981-99-9420-5_5