Use of Machine Learning Techniques in Soil Classification
Yaren Aydın,
Ümit Işıkdağ,
Gebrail Bekdaş (bekdas@iuc.edu.tr),
Sinan Melih Nigdeli and
Zong Woo Geem (geem@gachon.ac.kr)
Additional contact information
Yaren Aydın: Department of Civil Engineering, Istanbul University-Cerrahpaşa, 34320 Istanbul, Turkey
Ümit Işıkdağ: Department of Informatics, Mimar Sinan Fine Arts University, 34427 Istanbul, Turkey
Gebrail Bekdaş: Department of Civil Engineering, Istanbul University-Cerrahpaşa, 34320 Istanbul, Turkey
Sinan Melih Nigdeli: Department of Civil Engineering, Istanbul University-Cerrahpaşa, 34320 Istanbul, Turkey
Zong Woo Geem: Department of Smart City & Energy, Gachon University, Seongnam 13120, Republic of Korea
Sustainability, 2023, vol. 15, issue 3, 1-18
Abstract:
In the design of reliable structures, the soil classification process is the first step, which involves costly and time-consuming work including laboratory tests. Machine learning (ML), which has wide use in many scientific fields, can be utilized for facilitating soil classification. This study aims to provide a concrete example of the use of ML for soil classification. The dataset of the study comprises 805 soil samples based on the soil drillings of the new Gayrettepe–Istanbul Airport metro line construction. The dataset has both missing data and class imbalance. In the data preprocessing stage, first, data imputation techniques were applied to deal with the missing data. Two different imputation techniques were tested, and finally, the data were imputed with the KNN imputer. Later, a balance was achieved with the synthetic minority oversampling technique (SMOTE). After the preprocessing, a series of ML algorithms were tested with 10-fold cross-validation. Unlike the studies conducted in previous research, new gradient-boosting methods such as XGBoost, LightGBM, and CatBoost were tested, high classification accuracy rates of up to +90% were observed, and a significant improvement in the accuracy of prediction (when compared with previous research) was achieved.
Keywords: soil; machine learning; classification; ensemble learning (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/15/3/2374/pdf (application/pdf)
https://www.mdpi.com/2071-1050/15/3/2374/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:15:y:2023:i:3:p:2374-:d:1049347
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager (indexing@mdpi.com).