Extraction of Numerical Facts from German Texts to Enrich Internal Audit Data
Gerrit Schumann () and
Jorge Marx Gómez
Additional contact information
Gerrit Schumann: University of Oldenburg
Jorge Marx Gómez: University of Oldenburg
A chapter in Artificial Intelligence Tools and Applications in Embedded and Mobile Systems, 2024, pp 183-193 from Springer
Abstract:
Abstract Large-scale automated data processing is usually only possible for internal auditors in the case of structured data. Unstructured data, such as facts contained in texts, on the other hand, are often processed manually and using sampling. This, in turn, can increase the risk of disregarding relevant information during an audit. To address this risk, we present an approach that can be used to extract numerical facts along with their associated entities and relations from German texts and convert them into a format that can be processed by audit tools. The algorithm developed for this purpose follows a rule-based logic and was evaluated using 4637 sentences from 50 German annual reports. The results show that in more than 75% of all cases, the entity and relation of a numeric value within the sentence could be determined correctly.
Keywords: Information extraction; Natural language processing; Auditing (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:prochp:978-3-031-56576-2_16
Ordering information: This item can be ordered from
http://www.springer.com/9783031565762
DOI: 10.1007/978-3-031-56576-2_16
Access Statistics for this chapter
More chapters in Progress in IS from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().