Semantic Relatedness Estimation using the Layout Information of Wikipedia Articles
Patrick Chan,
Yoshinori Hijikata,
Toshiya Kuramochi and
Shogo Nishida
Additional contact information
Patrick Chan: Osaka University, Suita, Japan
Yoshinori Hijikata: Osaka University, Suita, Japan
Toshiya Kuramochi: Osaka University, Suita, Japan
Shogo Nishida: Osaka University, Suita, Japan
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 2013, vol. 7, issue 2, 30-48
Abstract:
Computing the semantic relatedness between two words or phrases is an important problem in fields such as information retrieval and natural language processing. Explicit Semantic Analysis (ESA), a state-of-the-art approach to solve the problem uses word frequency to estimate relevance. Therefore, the relevance of words with low frequency cannot always be well estimated. To improve the relevance estimate of low-frequency words and concepts, the authors apply regression to word frequency, its location in an article, and its text style to calculate the relevance. The relevance value is subsequently used to compute semantic relatedness. Empirical evaluation shows that, for low-frequency words, the authors’ method achieves better estimate of semantic relatedness over ESA. Furthermore, when all words of the dataset are considered, the combination of the authors’ proposed method and the conventional approach outperforms the conventional approach alone.
Date: 2013
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/ijcini.2013040103 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jcini0:v:7:y:2013:i:2:p:30-48
Access Statistics for this article
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) is currently edited by Kangshun Li
More articles in International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().