Imputing trip purposes for long-distance travel
Yijing Lu () and
Lei Zhang ()
Transportation, 2015, vol. 42, issue 4, 595 pages
Abstract:
Planning and policy analysis at the national, state and inter-regional corridor levels depends on reliable information and forecasts about long-distance travel. Emerging passive data collection technologies such as GPS, smartphones, and social media provide the opportunity for researchers and practitioners to potentially supplement or replace traditional long-distance travel surveys. However, certain important trip information, such as trip purpose, travel mode, and travelers’ socio-demographic characteristics, is missing from passively collected travel data. One promising solution to this data issue is to impute the missing information based on supplementary data (e.g., land use) and advanced statistical or data mining algorithms. This paper develops machine learning methods, including decision tree and meta-learning, to estimate trip purposes for long-distance passenger travel. A passively collected long-distance trip dataset is simulated from the 1995 American Travel Survey for the development and validation of the machine learning methods. The predictive accuracy of the proposed methods is evaluated for several scenarios varying with trip purposes and the extent of data availability as inputs. This research design will provide not only a practically useful approach for long-distance trip purpose imputation, but also generate valuable insights for future long-distance travel surveys. Results show that the accuracy of the trip purpose imputation methods based on all available data decreases from 95 % with two purposes (business and non-business) to 77 % with four purposes (business, personal business, social visit, and leisure). Based on a two-purpose scheme, the predictive accuracy of the imputation algorithms decreases from 95 % when all input data is used (a full-information model), to 72 % with a minimum information model that only utilizes the passively collected data. If traveler’s socio-demographic characteristics are available (possibly through other imputation models), the predictive accuracy only decreases from 95 to 91 %. Copyright Springer Science+Business Media New York 2015
Keywords: Trip purpose imputation model; Long distance travel; Passively-collected spatial–temporal data; Travel survey methods; Machine learning (search for similar items in EconPapers)
Date: 2015
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://hdl.handle.net/10.1007/s11116-015-9595-0 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:kap:transp:v:42:y:2015:i:4:p:581-595
Ordering information: This journal article can be ordered from
http://www.springer. ... ce/journal/11116/PS2
DOI: 10.1007/s11116-015-9595-0
Access Statistics for this article
Transportation is currently edited by Kay W. Axhausen
More articles in Transportation from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().