EconPapers    
Economics at your fingertips  
 

Automatic topic labelling for text document using ontology of graph-based concepts and dependency graph

Phu Pham, Phuc Do and Chien D.C. Ta

International Journal of Business Information Systems, 2021, vol. 36, issue 2, 221-253

Abstract: Topic labelling is an important task of text mining. It supports assigning proper topic labels to the text documents. In this paper, we present a novel approach of using graph-based concept matching approach in solving automatic topic labelling task. Our proposed model demonstrates that the quality of automatic topic labelling task for text documents can be improved, in comparison with traditional keyword-based concept matching approach. In this paper, we propose a novel approach of automatic ontology-driven topic labelling. Our proposed model is considered as a semi-supervised approach. It uses existed ontologies as the pre-knowledge base for topic identification in text documents. We performed the experiments on the real-world ACM's documents to show the effectiveness of our proposed model in solving topic labelling task. The experimental results on real-world and standard datasets demonstrate that our proposed model can leverage the output accuracy of topic labelling task in text documents.

Keywords: automatic topic labelling; ontology-driven topic labelling; dependency graph parsing; graph-based concept; frequent subgraph mining; FSM. (search for similar items in EconPapers)
Date: 2021
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=112826 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijbisy:v:36:y:2021:i:2:p:221-253

Access Statistics for this article

More articles in International Journal of Business Information Systems from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijbisy:v:36:y:2021:i:2:p:221-253