Examining trip-level errors in passively collected mobile device data for data quality assurance
Peiqi Zhang,
Kathleen Stewart and
Aref Darzi
PLOS ONE, 2025, vol. 20, issue 4, 1-17
Abstract:
Location-based service (LBS) data passively collected by mobile devices has been widely adopted in multiple fields for its advantages in revealing travel behaviors. Data quality assessments have always been important steps for analyses using the data, but the impact of trip-level errors has not been a focus of these assessments. We examine a newly emerged type of error present at trip-level in LBS datasets that violates the spatio-temporal consistency of such data by including trips on road segments where and when there should be no trips. We designed a distributed-computing workflow to quantify the errors by comparing the number of trips on closed road segments during road closures with time periods before and after. Using two real-world cases from 2023, we examined multiple datasets acquired from major vendors in the US, and several of the datasets contained a significant number of trip-level errors. These findings point to the errors being present in recent datasets that have not otherwise been processed for data quality and can significantly impact analyses by data users. Data users should consider conducting trip-level error data quality checks as part of their preprocessing steps.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0321970 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 21970&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0321970
DOI: 10.1371/journal.pone.0321970
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().