Merging textual and numerical databases: a steppingstone for statistical analyses of illegal events
Maria Francesca Romano (),
Pasquale Pavone (),
Antonella Baldassarini (),
Giuseppe Di Vetta () and
Gaetana Morgante ()
Additional contact information
Maria Francesca Romano: Scuola Superiore Sant’Anna, Institute of Economics & L’EmbeDS
Pasquale Pavone: Università Pegaso
Antonella Baldassarini: ISTAT & L’EmbeDS
Giuseppe Di Vetta: Scuola Superiore Sant’Anna, DirPolis Institute & L’EmbeDS
Gaetana Morgante: Scuola Superiore Sant’Anna, DirPolis Institute & L’EmbeDS
Quality & Quantity: International Journal of Methodology, 2025, vol. 59, issue 2, No 11, 1018 pages
Abstract:
Abstract This paper aims to define a methodological path—merging judgments and official statistical data—to organize complete, objective, and reliable data in a database, thus simplifying the analysis of illegal social phenomena. Judiciary judgments are a new data source: they deal with illegal events that describe social phenomena—even if they are only the "illegal" ones—and contain objective and reliable data and information. Judiciary judgments are also texts, so the first step is a statistical textual analysis and text mining techniques to discover information and organize it in a statistical database. The final database is obtained by integrating numerical data from other information sources. It therefore has statistical properties such as reliability, completeness and updating. Subsequent statistical analyses or modelling are then possible based on the entire set or subsets of data adequately extracted from the implemented statistical database. We present some results obtained from judgments about corruption in order to demonstrate the advantages of linking textual databases (textual analyses on judgments) and numerical databases (from ISTAT). The proposed methodology can benefit different stakeholders, such as researchers, policymakers, and other enforcement actors. It is independent of the specific software used and remains valid when applied to other illegal activities (e.g., organized crime, tax crime, and money laundering). Furthemore, the results may be even more effective if the institutional actors involved have access to judgments at all levels, thus overcoming potential privacy concerns. The methodology could also be used to support evidence-based policy in the fight against crime and illegal activities.
Keywords: Textual database; Merging textual and numerical database; Statistical database; Illegal events; Data science and criminal law (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11135-024-02039-w Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:qualqt:v:59:y:2025:i:2:d:10.1007_s11135-024-02039-w
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11135
DOI: 10.1007/s11135-024-02039-w
Access Statistics for this article
Quality & Quantity: International Journal of Methodology is currently edited by Vittorio Capecchi
More articles in Quality & Quantity: International Journal of Methodology from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().