Document Selection for Knowledge Discovery in Texts: Framework Development and Demonstration
Benjamin Matthies () and
André Coners
Additional contact information
Benjamin Matthies: South Westphalia University of Applied Sciences, Hagen, Germany
André Coners: South Westphalia University of Applied Sciences, Hagen, Germany
Journal of Information & Knowledge Management (JIKM), 2017, vol. 16, issue 04, 1-24
Abstract:
The large and constantly growing amounts of available text documents hold great potential for the exploration of knowledge. However, in the light of the vast quantity and variety of available documents, one fact should not be forgotten: the results of a knowledge discovery in texts are only as good as the underlying document collection. That is why analysts have to ensure that document collections adequately represent the specific area under examination and thereby to minimise the bias and to maximise the generalisable nature of the knowledge brought to light. Surprisingly, knowledge management research has barely paid any attention to the problems of such a document quality assessment and rigorous document selection. This paper addresses that research gap and makes two contributions: In the first step, building on a cross-disciplinary exchange with social research, development of a framework for the quality assessment and collection of documents. This artefact provides concrete guidance for compiling suitable, high-quality document collections and makes a contribution to ensuring “document collection quality” within the context of knowledge discovery in texts. In the second step, the framework is evaluated in a practical demonstration. In this context, the demonstration also exemplifies how different document collections influence the results of knowledge discoveries.
Keywords: Document analysis; knowledge discovery in texts; text mining; document collection quality (search for similar items in EconPapers)
Date: 2017
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649217500381
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:16:y:2017:i:04:n:s0219649217500381
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649217500381
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().