Hypatia Digital Library: A Text Classification Approach Based on Abstracts
Frosso Vorgia (),
Ioannis Triantafyllou () and
Alexandros Koulouris ()
Additional contact information
Frosso Vorgia: Technological Educational Institute of Athens
Ioannis Triantafyllou: Technological Educational Institute of Athens
Alexandros Koulouris: Technological Educational Institute of Athens
A chapter in Strategic Innovative Marketing, 2017, pp 727-733 from Springer
Abstract:
Abstract The purpose of this paper is to investigate the application of text classification in Hypatia, the digital library of Technological Educational Institute of Athens, in order to provide an automated classification tool as an alternative to manual assignments. The crucial point in text classification is the selection of the most important term-words for document representation. Classic weighting method TF.IDF was investigated. Our document collection consists of 718 abstracts in Medicine, Tourism and Food Technology. Classification was conducted utilizing 14 classifiers available on WEKA. Classification process yielded an excellent ~97 % precision score.
Keywords: Digital libraries; Text classification; WEKA; Word stemming (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:prbchp:978-3-319-33865-1_89
Ordering information: This item can be ordered from
http://www.springer.com/9783319338651
DOI: 10.1007/978-3-319-33865-1_89
Access Statistics for this chapter
More chapters in Springer Proceedings in Business and Economics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().