Impact of Data Normalization on Classification Model Accuracy
Borkin Dmitrii (),
Némethová Andrea (),
Michaľčonok German () and
Maiorov Konstantin ()
Additional contact information
Borkin Dmitrii: Slovak University of Technology in Bratislava, Faculty of Materials Science and Technology in Trnava, Institute of Applied Informatics, Automation and Mechatronics, Ulica Jána Bottu Č. 2781/25, 917 24Trnava, Slovak Republic
Némethová Andrea: Slovak University of Technology in Bratislava, Faculty of Materials Science and Technology in Trnava, Institute of Applied Informatics, Automation and Mechatronics, Ulica Jána Bottu Č. 2781/25, 917 24Trnava, Slovak Republic
Michaľčonok German: Slovak University of Technology in Bratislava, Faculty of Materials Science and Technology in Trnava, Institute of Applied Informatics, Automation and Mechatronics, Ulica Jána Bottu Č. 2781/25, 917 24Trnava, Slovak Republic
Maiorov Konstantin: Kalashnikov Izhevsk State Technical University, Department of Computer Software, 4260069 Izhevsk, Ul. Studenčeskaja 7, Russian Federation
Research Papers Faculty of Materials Science and Technology Slovak University of Technology, 2019, vol. 27, issue 45, 79-84
Abstract:
In this paper, we present the impact of the data normalization on the classification model performance. In first part of this paper, we present the structure of our dataset, where we discuss the features of the data set and basic statistical analysis of the data. In this research, we worked with the medical data about the patients with the Parkinson disease. In second part of this paper, we present the process of data normalization and the impact of scaling data on the classification model performance. In this research, we used the XGBoost model as our classification model. The main classification task was to classify whether the patient is ill with Parkinson disease or not. Since the data set contains more numerical parameters of different scaling, the main aim of this paper was to investigate the impact of the data normalization (scaling) on the performance of the classification model.
Keywords: Data normalization; model accuracy; classification (search for similar items in EconPapers)
Date: 2019
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.2478/rput-2019-0029 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:vrs:repfms:v:27:y:2019:i:45:p:79-84:n:11
DOI: 10.2478/rput-2019-0029
Access Statistics for this article
Research Papers Faculty of Materials Science and Technology Slovak University of Technology is currently edited by Kvetoslava Rešetová
More articles in Research Papers Faculty of Materials Science and Technology Slovak University of Technology from Sciendo
Bibliographic data for series maintained by Peter Golla ().