EconPapers    
Economics at your fingertips  
 

Mining Text with the Prototype-Matching Method

A. Durfee, A. Visa, H. Vanharanta, S. Schneberger and B. Back
Additional contact information
A. Durfee: Appalachian State University, USA
A. Visa: Tampere University of Technology, Finland
H. Vanharanta: Tampere University of Technology, Finland
S. Schneberger: Appalachian State University, USA
B. Back: Åbo Akademi University, Finland

Information Resources Management Journal (IRMJ), 2007, vol. 20, issue 3, 19-31

Abstract: Text documents are the most common means for exchanging formal knowledge among people. Text is a rich medium that can contain a vast range of information, but text can be difficult to decipher automatically. Many organizations have vast repositories of textual data but with few means of automatically mining that text. Text mining methods seek to use an understanding of natural language text to extract information relevant to user needs. This article evaluates a new text mining methodology: prototype-matching for text clustering, developed by the authors’ research group. The methodology was applied to four applications: clustering documents based on their abstracts, analyzing financial data, distinguishing authorship, and evaluating multiple translation similarity. The results are discussed in terms of common business applications and possible future research.

Date: 2007
References: Add references at CitEc
Citations:

Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 4018/irmj.2007070102 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:igg:rmj000:v:20:y:2007:i:3:p:19-31

Access Statistics for this article

Information Resources Management Journal (IRMJ) is currently edited by George Kelley

More articles in Information Resources Management Journal (IRMJ) from IGI Global
Bibliographic data for series maintained by Journal Editor ().

 
Page updated 2025-03-19
Handle: RePEc:igg:rmj000:v:20:y:2007:i:3:p:19-31