Distributional Semantic Model Based on Convolutional Neural Network for Arabic Textual Similarity
Adnen Mahmoud and
Mounir Zrigui
Additional contact information
Adnen Mahmoud: Higher Institute of Computer Science and Communication Techniques, Monastir, Tunisia
Mounir Zrigui: Faculty of Science Monastir, Monastir, Tunisia
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI), 2020, vol. 14, issue 1, 35-50
Abstract:
The problem addressed is to develop a model that can reliably identify whether a previously unseen document pair is paraphrased or not. Its detection in Arabic documents is a challenge because of its variability in features and the lack of publicly available corpora. Faced with these problems, the authors propose a semantic approach. At the feature extraction level, the authors use global vectors representation combining global co-occurrence counting and a contextual skip gram model. At the paraphrase identification level, the authors apply a convolutional neural network model to learn more contextual and semantic information between documents. For experiments, the authors use Open Source Arabic Corpora as a source corpus. Then the authors collect different datasets to create a vocabulary model. For the paraphrased corpus construction, the authors replace each word from the source corpus by its most similar one which has the same grammatical class applying the word2vec algorithm and the part-of-speech annotation. Experiments show that the model achieves promising results in terms of precision and recall compared to existing approaches in the literature.
Date: 2020
References: Add references at CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/IJCINI.2020010103 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jcini0:v:14:y:2020:i:1:p:35-50
Access Statistics for this article
International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) is currently edited by Kangshun Li
More articles in International Journal of Cognitive Informatics and Natural Intelligence (IJCINI) from IGI Global
Bibliographic data for series maintained by Journal Editor ().