EconPapers    
Economics at your fingertips  
 

Hypatia Digital Library: A Text Classification Approach Based on Abstracts

Frosso Vorgia (), Ioannis Triantafyllou () and Alexandros Koulouris ()
Additional contact information
Frosso Vorgia: Technological Educational Institute of Athens
Ioannis Triantafyllou: Technological Educational Institute of Athens
Alexandros Koulouris: Technological Educational Institute of Athens

A chapter in Strategic Innovative Marketing, 2017, pp 727-733 from Springer

Abstract: Abstract The purpose of this paper is to investigate the application of text classification in Hypatia, the digital library of Technological Educational Institute of Athens, in order to provide an automated classification tool as an alternative to manual assignments. The crucial point in text classification is the selection of the most important term-words for document representation. Classic weighting method TF.IDF was investigated. Our document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Classification was conducted utilizing 14 classifiers available on WEKA. Classification process yielded an excellent ~97 % precision score.

Keywords: Digital libraries; Text classification; WEKA; Word stemming (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:prbchp:978-3-319-33865-1_89

Ordering information: This item can be ordered from
http://www.springer.com/9783319338651

DOI: 10.1007/978-3-319-33865-1_89

Access Statistics for this chapter

More chapters in Springer Proceedings in Business and Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:prbchp:978-3-319-33865-1_89