Evaluation of Nitrate Load Estimations Using Neural Networks and Canonical Correlation Analysis with K-Fold Cross-Validation
Kichul Jung,
Deg-Hyo Bae,
Myoung-Jin Um,
Siyeon Kim,
Seol Jeon and
Daeryong Park
Additional contact information
Kichul Jung: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Deg-Hyo Bae: Department of Civil and Environmental Engineering, Sejong University, Seoul 05006, Korea
Myoung-Jin Um: Department of Civil Engineering, Kyonggi University, Suwon 16227, Korea
Siyeon Kim: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Seol Jeon: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Daeryong Park: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Sustainability, 2020, vol. 12, issue 1, 1-17
Abstract:
The present work aimed to examine the feasibility of using artificial neural network (ANN) based models to obtain accurate estimates of nitrate loads in river basins, which is an important parameter for water quality management. Both Single ANN (SANN) and Ensemble ANN (EANN) models were used to obtain the load estimations for five river basins in the Midwest United States. These basins included the Cuyahoga, Raisin, Sandusky, Muskingum, and Vermilion basins in Michigan and Ohio. Further, canonical correlation analysis (CCA) was applied to the ANN models to improve the performance. The k-fold cross-validation method was then utilized to evaluate the proposed models based on two statistical indices, namely, the rRMSE and rBAIS , and the estimates were compared for four different k values (k = 3, 5, 7, and 10). According to the results, the EANN model seemed to produce better load estimations than the SANN model, and the CCA based EANN model tended to produce the best estimates among all of the proposed models in this study. The box plot data for the rRMSE index were also investigated, and the plot results indicated that increasing values of k tended to generate better estimates. Thus, the use of k = 10 is recommended for load estimations since this value was associated with better performances and less biased estimates.
Keywords: single artificial neural network; canonical correlation analysis; ensemble artificial neural network; k-fold cross-validation; load estimations; Midwest; nitrate (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
https://www.mdpi.com/2071-1050/12/1/400/pdf (application/pdf)
https://www.mdpi.com/2071-1050/12/1/400/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:12:y:2020:i:1:p:400-:d:305064
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().