Learning Chinese Word Segmentation Based on Bidirectional GRU-CRF and CNN Network Model
Chenghai Yu,
Shupei Wang and
Jiajun Guo
Additional contact information
Chenghai Yu: Zhejiang Sci-Tech University, Zhejiang, China
Shupei Wang: Zhejiang Sci-Tech University, Zhejiang, China
Jiajun Guo: Zhejiang Sci-Tech University, Zhejiang, China
International Journal of Technology and Human Interaction (IJTHI), 2019, vol. 15, issue 3, 47-62
Abstract:
Chinese word segmentation is the basis of the Chinese natural language processing (NLP). With the development of the deep learning, various neural network models are applied to the Chinese word segmentation. However, current neural network models have the characteristics of artificial feature extraction, nonstandard word-weight, inability to effectively use long-distance information and long training time of models in Chinese word segmentation. To solve a series of problems, this article presents a CNN-Bidirectional GRU-CRF neural network model (CNN Bidirectional GRU CRF Network, CBiGCN), which breaks through the limit of conventional method window, truly realizes end-to-end processing and applies to the neural network model by the five-Tag set method, bias-variable-weight greedy strategy and supplements by Goldstein-Armijo guidelines. Besides, this model, with simple structure, is easy to be operated. And it can automatically learn features, reduces large amounts of tasks on specific knowledge in the form of handcrafted features and data pre-processing, makes use of context information effectively. The authors set an experiment with two data corpuses for Chinese word segmentation to evaluate their system. The experiment verified their new model can obtain better Chinese word segmentation results and greatly reduce training time.
Date: 2019
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 018/IJTHI.2019070104 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jthi00:v:15:y:2019:i:3:p:47-62
Access Statistics for this article
International Journal of Technology and Human Interaction (IJTHI) is currently edited by Anabela Mesquita
More articles in International Journal of Technology and Human Interaction (IJTHI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().