Urban rail transit passenger flow prediction using large language model under multi-source spatiotemporal data fusion

Ma, Changxi; Zhao, Mingxi

Urban rail transit passenger flow prediction using large language model under multi-source spatiotemporal data fusion

Changxi Ma and Mingxi Zhao

Physica A: Statistical Mechanics and its Applications, 2025, vol. 675, issue C

Abstract: With the continuous development of urban rail transit systems, passenger flow prediction has become a crucial component for optimizing operational scheduling. This paper proposes a metro passenger flow prediction method based on Large Language Model (LLM), incorporating hyperparameter tuning, pre-training, and multi-source spatiotemporal data fusion to enhance prediction accuracy. Firstly, the generative pre-trained transformer 2 (GPT-2) model is employed for passenger flow prediction, with fine-tuning strategies applied to adjust the pre-trained model to meet the specific requirements of the prediction task. During the fine-tuning process, some pre-trained parameters are frozen, and the Cosine Annealing Learning Rate strategy is used to gradually adjust the learning rate, effectively preventing overfitting and achieving efficient optimization. Secondly, the self-attention mechanism is utilized to fuse multi-source data, including air quality, weather conditions, and spatiotemporal passenger flow data, enhancing the model’s ability to capture complex fluctuations in passenger flow. The experimental results indicate that the proposed prediction model outperforms other models by achieving the lowest prediction error and demonstrating better capability in capturing complex passenger flow fluctuations. Ablation studies further validate the importance of multi-source data fusion in metro passenger flow prediction, as well as the effectiveness of fine-tuning in enhancing model performance. In addition, short-term prediction experiments investigate the model’s ability to handle immediate, near-future passenger flow forecasts. Furthermore, few-shot learning experiments analyze the impact of the training data sample ratio on prediction error and training efficiency. This paper offers a new perspective on applying LLM to metro passenger flow prediction.

Keywords: Urban Rail Transit; Large Language Model; Passenger Flow Prediction; Multi-Source Data Fusion (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0378437125004753
Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:phsmap:v:675:y:2025:i:c:s0378437125004753

DOI: 10.1016/j.physa.2025.130823

Access Statistics for this article

Physica A: Statistical Mechanics and its Applications is currently edited by K. A. Dawson, J. O. Indekeu, H.E. Stanley and C. Tsallis

More articles in Physica A: Statistical Mechanics and its Applications from Elsevier
Bibliographic data for series maintained by Catherine Liu ().