Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques
Yong-Su Kwon,
In-Hwan Cho,
Ha-Kyung Kim,
Jeong-Hwan Byun,
Mi-Jung Bae and
Baik-Ho Kim
Additional contact information
Yong-Su Kwon: EcoBank Team, Division of Ecological Information, National Institute of Ecology, Seocheon, Busan 33657, Chungcheongnam-do, Korea
In-Hwan Cho: Department of Environmental Science, Hanyang University, Seoul 04763, Korea
Ha-Kyung Kim: Department of Environmental Science, Hanyang University, Seoul 04763, Korea
Jeong-Hwan Byun: Han-River Environment Research Center, National Institute of Environmental Research, Yangpyeong-gun, Incheon 12585, Gyeonggi-do, Korea
Mi-Jung Bae: Biodiversity Research Team, Freshwater Biodiversity Research Bureau, Nakdonggang National Institute of Biological Resources, Sangju 37242, Gyeongsangbuk-do, Korea
Baik-Ho Kim: Department of Life Science, Hanyang University, Seoul 04763, Korea
IJERPH, 2021, vol. 18, issue 19, 1-13
Abstract:
Geosmin is a major concern in the management of water sources worldwide. Thus, we predicted concentration categories of geosmin at three different depths of lakes (i.e., surface, middle, and bottom), and analyzed relationships between geosmin concentration and factors such as phytoplankton abundance and environmental variables. Data were collected monthly from three major lakes (Uiam, Cheongpyeong, and Paldang lakes) in Korea from May 2014 to December 2015. Before predicting geosmin concentration, we categorized it into four groups based on the boxplot method, and multivariate adaptive regression splines, classification and regression trees, and random forest (RF) were applied to identify the most appropriate modelling to predict geosmin concentration. Overall, using environmental variables was more accurate than using phytoplankton abundance to predict the four categories of geosmin concentration based on AUC and accuracy in all three models as well as each layer. The RF model had the highest predictive power among the three SDMs. When predicting geosmin in the three water layers, the relative importance of environmental variables and phytoplankton abundance in the sensitivity analysis was different for each layer. Water temperature and abundance of Cyanophyceae were the most important factors for predicting geosmin concentration categories in the surface layer, whereas total abundance of phytoplankton exhibited relatively higher importance in the bottom layer.
Keywords: taste-and-odor compound; off-flavor material; species distribution models; random forest; vertical difference (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/1660-4601/18/19/10303/pdf (application/pdf)
https://www.mdpi.com/1660-4601/18/19/10303/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:18:y:2021:i:19:p:10303-:d:646956
Access Statistics for this article
IJERPH is currently edited by Ms. Jenna Liu
More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().