Predicting Sheet and Rill Erosion of Shihmen Reservoir Watershed in Taiwan Using Machine Learning
Kieu Anh Nguyen,
Walter Chen,
Bor-Shiun Lin,
Uma Seeboonruang and
Kent Thomas
Additional contact information
Kieu Anh Nguyen: Department of Civil Engineering, National Taipei University of Technology, Taipei 10608, Taiwan
Walter Chen: Department of Civil Engineering, National Taipei University of Technology, Taipei 10608, Taiwan
Bor-Shiun Lin: Disaster Prevention Technology Research Center, Sinotech Engineering Consultants, Taipei 11494, Taiwan
Uma Seeboonruang: Faculty of Engineering, King Mongkut’s Institute of Technology Ladkrabang, Bangkok 10520, Thailand
Kent Thomas: Department of Civil Engineering, National Taipei University of Technology, Taipei 10608, Taiwan
Sustainability, 2019, vol. 11, issue 13, 1-18
Abstract:
Shihmen Reservoir watershed is vital to the water supply in Northern Taiwan but the reservoir has been heavily impacted by sedimentation and soil erosion since 1964. The purpose of this study was to explore the capability of machine learning algorithms, such as decision tree and random forest, to predict soil erosion (sheet and rill erosion) depths in the Shihmen reservoir watershed. The accuracy of the models was evaluated using the RMSE (Root Mean Squared Error), MAE (Mean Absolute Error), and R 2 . Moreover, the models were verified against the multiple regression analysis, which is commonly used in statistical analysis. The predictors of these models were 14 environmental factors which influence soil erosion, whereas the target was 550 erosion pins installed at 55 locations (on 55 slopes) and monitored over a period of approximately three years. The data sets for the models were separated into 70% for the training data and 30% for the testing data, using the simple random sampling and stratified random sampling methods. The results show that the random forest algorithm performed the best of the three methods. Moreover, the stratified random sampling method had better results among the two sampling methods, as anticipated. The average error ( RMSE relative to 1:1 line) of the stratified random sampling method of the random forest algorithm is 0.93 mm/yr in the training data and 1.75 mm/yr in the testing data, respectively. Finally, the random forest algorithm predicted that type of slope, slope direction, and sub-watershed are the three most important factors of the 14 environmental factors collected and used in this study for splits in the trees and thus they are the three most important factors affecting the depth of sheet and rill erosion in the Shihmen Reservoir watershed. The results of this study can be employed by decision-makers to improve soil conservation planning and watershed remediation.
Keywords: soil erosion; sheet and rill erosion; erosion pins; random forest; decision tree; multiple regression; Shihmen reservoir; machine learning (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2019
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
https://www.mdpi.com/2071-1050/11/13/3615/pdf (application/pdf)
https://www.mdpi.com/2071-1050/11/13/3615/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:11:y:2019:i:13:p:3615-:d:244684
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().