EconPapers    
Economics at your fingertips  
 

Unveiling the impact of machine learning algorithms on the quality of online geocoding services: a case study using COVID-19 data

Batuhan Kilic (), Onur Can Bayrak (), Fatih Gülgen (), Mert Gurturk () and Perihan Abay ()
Additional contact information
Batuhan Kilic: Yildiz Technical University
Onur Can Bayrak: Yildiz Technical University
Fatih Gülgen: Yildiz Technical University
Mert Gurturk: Yildiz Technical University
Perihan Abay: Kanuni Sultan Süleyman Research and Training Hospital

Journal of Geographical Systems, 2024, vol. 26, issue 4, No 8, 622 pages

Abstract: Abstract In today's era, the address plays a crucial role as one of the key components that enable mobility in daily life. Address data are used by global map platforms and location-based services to pinpoint a geographically referenced location. Geocoding provided by online platforms is useful in the spatial tracking of reported cases and controls in the spatial analysis of infectious illnesses such as COVID-19. The first and most critical phase in the geocoding process is address matching. However, due to typographical errors, variations in abbreviations used, and incomplete or malformed addresses, the matching can seldom be performed with 100% accuracy. The purpose of this research is to examine the capabilities of machine learning classifiers that can be used to measure the consistency of address matching results produced by online geocoding services and to identify the best performing classifier. The performance of the seven machine learning classifiers was compared using several text similarity measures, which assess the match scores between the input address data and the services' output. The data utilized in the testing came from four distinct online geocoding services applied to 925 addresses in Türkiye. The findings from this study revealed that the Random Forest machine learning classifier was the most accurate in the address matching procedure. While the results of this study hold true for similar datasets in Türkiye, additional research is required to determine whether they apply to data in other countries.

Keywords: Address matching; COVID-19; Geocoding; Machine learning; Random forest (search for similar items in EconPapers)
JEL-codes: C45 C52 C53 I18 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s10109-023-00435-8 Abstract (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:kap:jgeosy:v:26:y:2024:i:4:d:10.1007_s10109-023-00435-8

Ordering information: This journal article can be ordered from
http://www.springer. ... ce/journal/10109/PS2

DOI: 10.1007/s10109-023-00435-8

Access Statistics for this article

Journal of Geographical Systems is currently edited by Manfred M. Fischer and Antonio Páez

More articles in Journal of Geographical Systems from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-19
Handle: RePEc:kap:jgeosy:v:26:y:2024:i:4:d:10.1007_s10109-023-00435-8