A Group Feature Screening Procedure Based on Pearson Chi-Square Statistic for Biology Data with Categorical Response
Hanji He,
Jianfeng He,
Guangming Deng and
Nian-Sheng Tang
Journal of Mathematics, 2024, vol. 2024, 1-21
Abstract:
The analysis of biogenetic data makes an important contribution to the understanding of disease mechanisms and the diagnosis of rare diseases. In this analysis, the selection of significant features affecting the disease provides an effective basis for subsequent disease judgment and treatment direction. However, this is not a simple task as biogenetic data have challenges such as ultra-high dimensionality of potential features, imbalance of response variables, and genetic associations. This study focuses on the group structure in feature screening with biogenetic data. Specifically, group structure exists for biogenetic data, so we need to analyze the entire genome rather than individual strongly correlated genes. This study proposes a group feature screening method that considers group correlations using adjusted Pearson’s cardinality statistic to address this issue. The method can be applied to both continuous and discrete covariates. The performance of the proposed method is illustrated by simulation studies, where the proposed method performs well with imbalanced data and multicategorical responses. In the application of lung cancer diagnosis, the proposed method for imbalanced data categorization is impressive, and the dimension reduction using linear discriminant is still good.
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://downloads.hindawi.com/journals/jmath/2024/9014764.pdf (application/pdf)
http://downloads.hindawi.com/journals/jmath/2024/9014764.xml (application/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jjmath:9014764
DOI: 10.1155/2024/9014764
Access Statistics for this article
More articles in Journal of Mathematics from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().