Codifying collaborative knowledge: using Wikipedia as a basis for automated ontology learning
Tao Guo,
David G Schwartz,
Frada Burstein and
Henry Linger
Knowledge Management Research & Practice, 2009, vol. 7, issue 3, 206-217
Abstract:
In the context of knowledge management, ontology construction can be considered as a part of capturing of the body of knowledge of a particular problem domain. Traditionally, ontology construction assumes a tedious codification of the domain experts knowledge. In this paper, we describe a new approach to ontology engineering that has the potential of bridging the dichotomy between codification and collaboration turning to Web 2.0 technology. We propose to shift the primary source of ontology knowledge from the expert to socially emergent bodies of knowledge such as Wikipedia. Using Wikipedia as an example, we demonstrate how core terms and relationships of a domain ontology can be distilled from this socially constructed source. As an illustration, we describe how our approach achieved over 90% conceptual coverage compared with Gold standard hand-crafted ontologies, such as Cyc. What emerges is not a folksonomy, but rather a formal ontology that has nonetheless found its roots in social knowledge.
Date: 2009
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1057/kmrp.2009.14 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tkmrxx:v:7:y:2009:i:3:p:206-217
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/tkmr20
DOI: 10.1057/kmrp.2009.14
Access Statistics for this article
Knowledge Management Research & Practice is currently edited by Giovanni Schiuma
More articles in Knowledge Management Research & Practice from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().