Towards Demand-Driven On-The-Fly Statistics
Gelsema Tjalling () and
Heuvel Guido van Den ()
Additional contact information
Gelsema Tjalling: Statistics Netherlands, Research and Development, Henri Faasdreef 312 The Hague 2492 JP, the Netherlands.
Heuvel Guido van Den: Statistics Netherlands, Research and Development, Henri Faasdreef 312 The Hague 2492 JP, the Netherlands.
Journal of Official Statistics, 2023, vol. 39, issue 3, 351-379
Abstract:
A prototype of a question answering (QA) system, called Farseer, for the real-time calculation and dissemination of aggregate statistics is introduced. Using techniques from natural language processing (NLP), machine learning (ML), artificial intelligence (AI) and formal semantics, this framework is capable of correctly interpreting a written request for (aggregate) statistics and subsequently generating appropriate results. It is shown that the framework operates in a way that is independent of a specific statistical domain under consideration, by capturing domain specific information in a knowledge graph that is input to the framework. However, it is also shown that the prototype still has its limitations, lacking statistical disclosure control. Also, searching the knowledge graph is still time-consuming.
Keywords: Dissemination; artificial intelligence; question answering; text-to-SQL; information modeling (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://doi.org/10.2478/jos-2023-0016 (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:vrs:offsta:v:39:y:2023:i:3:p:351-379:n:5
DOI: 10.2478/jos-2023-0016
Access Statistics for this article
Journal of Official Statistics is currently edited by Annica Isaksson and Ingegerd Jansson
More articles in Journal of Official Statistics from Sciendo
Bibliographic data for series maintained by Peter Golla ().