EconPapers    
Economics at your fingertips  
 

Semantic Similarity for English and Arabic Texts: A Review

Marwah Alian and Arafat Awajan ()
Additional contact information
Marwah Alian: Princess Sumaya University for Technology, Amman, Jordan2Hashemite University, Zarqa, Jordan
Arafat Awajan: Princess Sumaya University for Technology, Amman, Jordan

Journal of Information & Knowledge Management (JIKM), 2020, vol. 19, issue 04, 1-29

Abstract: Semantic similarity is the task of measuring relations between sentences or words to determine the degree of similarity or resemblance. Several applications of natural language processing require semantic similarity measurement to achieve good results; these applications include plagiarism detection, text entailment, text summarisation, paraphrasing identification, and information extraction. Many researchers have proposed new methods to measure the semantic similarity of Arabic and English texts. In this research, these methods are reviewed and compared. Results show that the precision of the corpus-based approach exceeds 0.70. The precision of the descriptive feature-based technique is between 0.670 and 0.86, with a Pearson correlation coefficient of over 0.70. Meanwhile, the word embedding technique has a correlation of 0.67, and its accuracy is in the range 0.76–0.80. The best results are achieved by the feature-based approach.

Keywords: Semantic similarity; feature-based; word embeddings; statistical corpus-based; sentence similarity; word similarity; document similarity (search for similar items in EconPapers)
Date: 2020
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649220500331
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:19:y:2020:i:04:n:s0219649220500331

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0219649220500331

Access Statistics for this article

Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh

More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:jikmxx:v:19:y:2020:i:04:n:s0219649220500331