Towards Large-Scale Unsupervised Relation Extraction from the Web
Bonan Min,
Shuming Shi,
Ralph Grishman and
Chin-Yew Lin
Additional contact information
Bonan Min: New York University, USA
Shuming Shi: Microsoft Research Asia, China
Ralph Grishman: New York University, USA
Chin-Yew Lin: Microsoft Research Asia, China
International Journal on Semantic Web and Information Systems (IJSWIS), 2012, vol. 8, issue 3, 1-23
Abstract:
The Web brings an open-ended set of semantic relations. Discovering the significant types is very challenging. Unsupervised algorithms have been developed to extract relations from a corpus without knowing the relation types in advance, but most rely on tagging arguments of predefined types. One recently reported system is able to jointly extract relations and their argument semantic classes, taking a set of relation instances extracted by an open IE (Information Extraction) algorithm as input. However, it cannot handle polysemy of relation phrases and fails to group many similar (“synonymous”) relation instances because of the sparseness of features. In this paper, the authors present a novel unsupervised algorithm that provides a more general treatment of the polysemy and synonymy problems. The algorithm incorporates various knowledge sources which they will show to be very effective for unsupervised relation extraction. Moreover, it explicitly disambiguates polysemous relation phrases and groups synonymous ones. While maintaining approximately the same precision, the algorithm achieves significant improvement on recall compared to the previous method. It is also very efficient. Experiments on a real-world dataset show that it can handle 14.7 million relation instances and extract a very large set of relations from the Web.
Date: 2012
References: Add references at CitEc
Citations:
Downloads: (external link)
https://services.igi-global.com/resolvedoi/resolve ... 018/jswis.2012070101 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jswis0:v:8:y:2012:i:3:p:1-23
Access Statistics for this article
International Journal on Semantic Web and Information Systems (IJSWIS) is currently edited by Brij Gupta
More articles in International Journal on Semantic Web and Information Systems (IJSWIS) from IGI Global
Bibliographic data for series maintained by Journal Editor ().