Conditional random fields for entity extraction and ontological text coding
Jana Diesner () and
Kathleen M. Carley
Additional contact information
Jana Diesner: Carnegie Mellon University
Kathleen M. Carley: Carnegie Mellon University
Computational and Mathematical Organization Theory, 2008, vol. 14, issue 3, No 4, 248-262
Abstract:
Abstract Previous research suggests that one field with a strong yet unsatisfied need for automatically extracting instances of various entity classes from texts is the analysis of socio-technical systems (Feldstein in Media in Transition MiT5, 2007; Hampe et al. in Netzwerkanalyse und Netzwerktheorie, 2007; Weil et al. in Proceedings of the 2006 Command and Control Research and Technology Symposium, 2006; Diesner and Carley in XXV Sunbelt Social Network Conference, 2005). Traditional as well as non-traditional and customized sets of entity classes and the relationships between them are often specified in ontologies or taxonomies. We present a Conditional Random Fields (CRF)-based approach to distilling a set of entities that are defined in an ontology originating from organization science. CRF, a supervised sequential machine learning technique, facilitates the derivation of relational data from corpora by locating and classifying instances of various entity classes. The classified entities can be used as nodes for the construction of socio-technical networks. We find the outcome sufficiently accurate (82.7 percent accuracy of locating and classifying entities) for future application in the described problem domain. We propose using the presented methodology as a crucial step in the process of advanced modeling and analysis of complex and dynamic networks.
Keywords: Ontological Text Coding; Semantic networks; Entity Extraction; Supervised machine learning; Conditional models; Conditional Random Fields (search for similar items in EconPapers)
Date: 2008
References: View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://link.springer.com/10.1007/s10588-008-9029-z Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:comaot:v:14:y:2008:i:3:d:10.1007_s10588-008-9029-z
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10588
DOI: 10.1007/s10588-008-9029-z
Access Statistics for this article
Computational and Mathematical Organization Theory is currently edited by Terrill Frantz and Kathleen Carley
More articles in Computational and Mathematical Organization Theory from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().