Cephalo-Pelvic Disproportion Data with Collinearities
Shuichi Shinmura ()
Additional contact information
Shuichi Shinmura: Seikei University, Faculty of Economics
Chapter Chapter 3 in New Theory of Discriminant Analysis After R. Fisher, 2016, pp 57-80 from Springer
Abstract:
Abstract I discriminate the cephalo-pelvic disproportion (CPD) data. These data have a significant relationship with the Theory (1) We evaluated a heuristic OLDF by these data. However, we could only evaluate a six-variable model because our CPU power was poor and because of the limitations of a heuristic OLDF. Therefore, we could not extend our research. (2) These data consist of 240 patients with 19 independent variables. We specified three collinearities in these data and established how to remove such collinearities. (3) We found a strange trend of NMs by QDF and found that QDF is fragile for collinearities. Moreover, NM of Fisher’s LDF did not decrease in the 19 models from the one-variable model to the 19-variable model selected by the forward and backward stepwise procedure. On the other hand, NMs of our three MP-based optimal LDFs (OLDFs) decreased. (4) In the CPD data, we determined that a four-variable model is useful by the regression model selection procedure (plug-in rule1). However, the new model selection procedure that uses Method 1 recommends a nine-variable model as the best model. We believe that many variables and/or collinearities cause this difference. Because the Iris data have four variables and can satisfy Fisher’s assumption, the model selection procedure by regression analysis and the best model select the full model of seven LDFs, which are Revised IP-OLDF, Revised LP-OLDF, Revised IPLP-OLDF, SVM4 and SVM1, Fisher’s LDF, and logistic regression. This fact is the reason we should no longer use the Iris data as the evaluation data. (5) CPD data have many OCPs. This fact implies that Revised IP-OLDF can search for the several OCPs with the same MNMs and different coefficients groups that belong to different OCPs. This result means that it is difficult for us to evaluate the 95 % CI of discriminant coefficients. In this chapter, we solve these problems by the 100-fold cross-validation for small sample method (Method 1) and the best models of seven LDFs.
Keywords: Collinearities; Model selection procedure; Fisher’s linear discriminant function (Fisher’s LDF); Logistic regression; Support vector machine (SVM); A minimum number of misclassification (minimum NM MNM); Mean of error rate; Revised IP-OLDF; Revised IPLP-OLDF; Revised LP-OLDF (search for similar items in EconPapers)
Date: 2016
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:sprchp:978-981-10-2164-0_3
Ordering information: This item can be ordered from
http://www.springer.com/9789811021640
DOI: 10.1007/978-981-10-2164-0_3
Access Statistics for this chapter
More chapters in Springer Books from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().