ChatGPT for GTFS: benchmarking LLMs on GTFS semantics... and retrieval
Saipraneeth Devunuri (),
Shirin Qiam () and
Lewis J. Lehe ()
Additional contact information
Saipraneeth Devunuri: University of Illinois at Urbana Champaign
Shirin Qiam: University of Illinois at Urbana Champaign
Lewis J. Lehe: University of Illinois at Urbana Champaign
Public Transport, 2024, vol. 16, issue 2, No 1, 333-357
Abstract:
Abstract The General Transit Feed Specification (GTFS) standard for publishing transit data is ubiquitous. With the advent of LLMs being used widely, this research explores the possibility of extracting transit information from GTFS through natural language instructions. To evaluate the capabilities and limitations of LLMs, we introduce two benchmarks, namely “GTFS Semantics” and “GTFS Retrieval” that test how well LLMs can “understand” GTFS standards and retrieve relevant transit information. We benchmark OpenAI’s GPT-3.5 Turbo and GPT-4 LLMs, which are backends for the ChatGPT interface. In particular, we use zero-shot, one-shot, chain of thought, and program synthesis techniques with prompt engineering. For our multiple questions, GPT-3.5 Turbo answers 59.7% correctly and GPT-4 answers 73.3% correctly, but they do worse when one of the multiple choice options is replaced by “None of these”. Furthermore, we evaluate how well the LLMs can extract information from a filtered GTFS feed containing four bus routes from the Chicago Transit Authority. Program synthesis techniques outperformed zero-shot approaches, achieving up to 93% (90%) accuracy for simple queries and 61% (41%) for complex ones using GPT-4 (GPT-3.5 Turbo).
Keywords: GTFS; ChatGPT; Large language models; Generative AI; GPT-3.5 Turbo; GPT-4 (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s12469-024-00354-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:pubtra:v:16:y:2024:i:2:d:10.1007_s12469-024-00354-x
Ordering information: This journal article can be ordered from
https://www.springer ... search/journal/12469
DOI: 10.1007/s12469-024-00354-x
Access Statistics for this article
Public Transport is currently edited by Stefan Voß
More articles in Public Transport from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().