Web Text Categorization Based on Statistical Merging Algorithm in Big Data Environment
Rujuan Wang and
Gang Wang
Additional contact information
Rujuan Wang: College of Humanities & Sciences of Northeast Normal University, Changchun, China
Gang Wang: Northeast Normal University, Changchun, China
International Journal of Ambient Computing and Intelligence (IJACI), 2019, vol. 10, issue 3, 17-32
Abstract:
In the field of modern information technology, how to find information quickly, accurately and comprehensively that users really needed has become the focus of research in this field. In this article, a feature selection method based on a complex network is proposed for the structure and content characteristics of large-scale web text information. The preprocessed web text is converted into a complex network. The nodes in the network correspond to the entries in the text. The edges of the network correspond to the links between the entries in the text, and the degree of nodes and the aggregation system are used. Second, the text classification method is studied from the point of view of data sampling, and a text classification method based on density statistics is proposed. This method uses not only the density information of the text feature set in the classification process, but also the use of statistical merging criteria to get the text. The difference information of each feature has a better classification effect for large text collections.
Date: 2019
References: Add references at CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/IJACI.2019070102 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jaci00:v:10:y:2019:i:3:p:17-32
Access Statistics for this article
International Journal of Ambient Computing and Intelligence (IJACI) is currently edited by Nilanjan Dey
More articles in International Journal of Ambient Computing and Intelligence (IJACI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().