EconPapers    
Economics at your fingertips  
 

Evaluation of Nitrate Load Estimations Using Neural Networks and Canonical Correlation Analysis with K-Fold Cross-Validation

Kichul Jung, Deg-Hyo Bae, Myoung-Jin Um, Siyeon Kim, Seol Jeon and Daeryong Park
Additional contact information
Kichul Jung: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Deg-Hyo Bae: Department of Civil and Environmental Engineering, Sejong University, Seoul 05006, Korea
Myoung-Jin Um: Department of Civil Engineering, Kyonggi University, Suwon 16227, Korea
Siyeon Kim: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Seol Jeon: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea
Daeryong Park: Department of Civil and Environmental Engineering, Konkuk University, Seoul 05029, Korea

Sustainability, 2020, vol. 12, issue 1, 1-17

Abstract: The present work aimed to examine the feasibility of using artificial neural network (ANN) based models to obtain accurate estimates of nitrate loads in river basins, which is an important parameter for water quality management. Both Single ANN (SANN) and Ensemble ANN (EANN) models were used to obtain the load estimations for five river basins in the Midwest United States. These basins included the Cuyahoga, Raisin, Sandusky, Muskingum, and Vermilion basins in Michigan and Ohio. Further, canonical correlation analysis (CCA) was applied to the ANN models to improve the performance. The k-fold cross-validation method was then utilized to evaluate the proposed models based on two statistical indices, namely, the rRMSE and rBAIS , and the estimates were compared for four different k values (k = 3, 5, 7, and 10). According to the results, the EANN model seemed to produce better load estimations than the SANN model, and the CCA based EANN model tended to produce the best estimates among all of the proposed models in this study. The box plot data for the rRMSE index were also investigated, and the plot results indicated that increasing values of k tended to generate better estimates. Thus, the use of k = 10 is recommended for load estimations since this value was associated with better performances and less biased estimates.

Keywords: single artificial neural network; canonical correlation analysis; ensemble artificial neural network; k-fold cross-validation; load estimations; Midwest; nitrate (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)

Downloads: (external link)
https://www.mdpi.com/2071-1050/12/1/400/pdf (application/pdf)
https://www.mdpi.com/2071-1050/12/1/400/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:12:y:2020:i:1:p:400-:d:305064

Access Statistics for this article

Sustainability is currently edited by Ms. Alexandra Wu

More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jsusta:v:12:y:2020:i:1:p:400-:d:305064