EconPapers    
Economics at your fingertips  
 

Data-Driven Leak Localization in Urban Water Distribution Networks Using Big Data for Random Forest Classifier

Ivana Lučin, Bože Lučin, Zoran Čarija and Ante Sikirica
Additional contact information
Ivana Lučin: Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia
Bože Lučin: Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia
Zoran Čarija: Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia
Ante Sikirica: Faculty of Engineering, University of Rijeka, Vukovarska 58, 51000 Rijeka, Croatia

Mathematics, 2021, vol. 9, issue 6, 1-14

Abstract: In the present paper, a Random Forest classifier is used to detect leak locations on two different sized water distribution networks with sparse sensor placement. A great number of leak scenarios were simulated with Monte Carlo determined leak parameters (leak location and emitter coefficient). In order to account for demand variations that occur on a daily basis and to obtain a larger dataset, scenarios were simulated with random base demand increments or reductions for each network node. Classifier accuracy was assessed for different sensor layouts and numbers of sensors. Multiple prediction models were constructed for differently sized leakage and demand range variations in order to investigate model accuracy under various conditions. Results indicate that the prediction model provides the greatest accuracy for the largest leaks, with the smallest variation in base demand (62% accuracy for greater- and 82% for smaller-sized networks, for the largest considered leak size and a base demand variation of ± 2.5 % ). However, even for small leaks and the greatest base demand variations, the prediction model provided considerable accuracy, especially when localizing the sources of leaks when the true leak node and neighbor nodes were considered (for a smaller-sized network and a base demand of variation ± 20 % the model accuracy increased from 44% to 89% when top five nodes with greatest probability were considered, and for a greater-sized network with a base demand variation of ± 10 % the accuracy increased from 36% to 77%).

Keywords: leak localization; water distribution network; random forest; prediction modeling; big data (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)

Downloads: (external link)
https://www.mdpi.com/2227-7390/9/6/672/pdf (application/pdf)
https://www.mdpi.com/2227-7390/9/6/672/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:9:y:2021:i:6:p:672-:d:521541

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:9:y:2021:i:6:p:672-:d:521541