A supervised machine learning model for imputing missing boarding stops in smart card data
Nadav Shalit (),
Michael Fire () and
Eran Ben-Elia ()
Additional contact information
Nadav Shalit: Ben-Gurion University of the Negev
Michael Fire: Ben-Gurion University of the Negev
Eran Ben-Elia: Ben-Gurion University of the Negev
Public Transport, 2023, vol. 15, issue 2, No 1, 287-319
Abstract:
Abstract Public transport has become an essential part of urban existence with increased population densities and environmental awareness. Large quantities of data are currently generated, allowing for more robust methods to understand travel behavior by harvesting smart card usage. However, public transport datasets suffer from data integrity problems; boarding stop information may be missing due to imperfect acquirement processes or inadequate reporting. This study introduces a supervised machine learning method to impute missing boarding stops based on ordinal classification using GTFS timetable, smart card, and geospatial datasets. A new metric, Pareto Accuracy, is suggested to evaluate algorithms where classes have an ordinal nature. The results are based on a case study in the city of Beer Sheva, Israel, consisting of one month of smart card data. We show that our proposed method is robust to irregular travelers and significantly outperforms well-known imputation methods without the need to mine any additional datasets. The data validation from another Israeli city using transfer learning shows the presented model is general and context-free. The implications for transportation planning and travel behavior research are further discussed.
Keywords: Machine learning; Smart card; Boarding stop imputation; Public transport; Missing data; Pareto accuracy (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s12469-022-00309-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:pubtra:v:15:y:2023:i:2:d:10.1007_s12469-022-00309-0
Ordering information: This journal article can be ordered from
https://www.springer ... search/journal/12469
DOI: 10.1007/s12469-022-00309-0
Access Statistics for this article
Public Transport is currently edited by Stefan Voß
More articles in Public Transport from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().