Text Mining arXiv: A Look Through Quantitative Finance Papers
Michele Leonardo Bianchi ()
Additional contact information
Michele Leonardo Bianchi: Financial Stability Directorate, Banca d’Italia, 00184 Rome, Italy
Mathematics, 2025, vol. 13, issue 9, 1-20
Abstract:
This paper explores articles hosted on the arXiv preprint server with the aim of uncovering valuable insights hidden in this vast collection of research. Employing text mining techniques and through the application of natural language processing methods, I xamine the contents of quantitative finance papers posted in arXiv from 1997 to 2022. I extract and analyze, without relying on ad hoc software or proprietary databases, crucial information from the entire documents, including the references, to understand the topic trends over time and to find out the most cited researchers and journals in this domain. Additionally, I compare numerous algorithms for performing topic modeling, including state-of-the-art approaches.
Keywords: quantitative finance; text mining; natural language processing; unsupervised clustering; topic modeling; research trends; named entity recognition (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/13/9/1375/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/9/1375/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:9:p:1375-:d:1640514
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().