SAW Classification Algorithm for Chinese Text Classification
Xiaoli Guo,
Huiyu Sun,
Tiehua Zhou,
Ling Wang,
Zhaoyang Qu and
Jiannan Zang
Additional contact information
Xiaoli Guo: School of Information Engineering, Northeast Dianli University, Jilin 132012, China
Huiyu Sun: School of Information Engineering, Northeast Dianli University, Jilin 132012, China
Tiehua Zhou: Database/Bioinformatics Laboratory, Chungbuk National University, Chungbuk 362-763, Korea
Ling Wang: School of Information Engineering, Northeast Dianli University, Jilin 132012, China
Zhaoyang Qu: School of Information Engineering, Northeast Dianli University, Jilin 132012, China
Jiannan Zang: School of Information Engineering, Northeast Dianli University, Jilin 132012, China
Sustainability, 2015, vol. 7, issue 3, 1-15
Abstract:
Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words have an implied correlation between text information mining and text categorization for high-correlation matching. Experiments show that SAW classification algorithm on the premise of ensuring precision in classification, significantly improve the classification precision and recall, obviously improving the performance of information retrieval, and providing an effective means of data use in the era of big data information extraction.
Keywords: big data; SAW classification algorithm; relevance (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2015
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.mdpi.com/2071-1050/7/3/2338/pdf (application/pdf)
https://www.mdpi.com/2071-1050/7/3/2338/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:7:y:2015:i:3:p:2338-2352:d:46200
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().