EconPapers    
Economics at your fingertips  
 

Discovering communities based on mention distance

Li Zhang (), Ming Liu (), Bo Wang (), Bo Lang () and Peng Yang ()
Additional contact information
Li Zhang: Beijing Information Science and Technology University
Ming Liu: National Computer Network Emergency Response Technical Team/Coordination Center of China
Bo Wang: National Computer Network Emergency Response Technical Team/Coordination Center of China
Bo Lang: Beihang University
Peng Yang: National Computer Network Emergency Response Technical Team/Coordination Center of China

Scientometrics, 2021, vol. 126, issue 3, No 4, 1945-1967

Abstract: Abstract Scholarly community detection has important applications in various fields. Current studies rely heavily on structured scholar networks, which have high computational complexity and are challenging to construct in practice. We propose a novel approach that can detect disjoint and overlapping scholarly communities directly from large textual corpora. To the best of our knowledge, this is the first study intended to detect communities directly from unstructured texts. In general, academic articles tend to mention related work and researchers. Researchers that are more closely related to each other are mentioned in a closer grouping in lines of academic text. Based on this correlation, we propose an intuitional method that measures the mutual relatedness of researchers through their textual distance. First, we extract and disambiguate the researcher names from academic articles. Then, we embed each researcher as an implicit vector and measure the relatedness of researchers by their vector distance. Finally, the communities are identified by vector clusters. We develop and evaluate our method on several real-world datasets. The experimental results demonstrate that our method achieves comparable performance with several state-of-the-art methods.

Keywords: Community detection; Scholarly big data; Representation learning; Scientific information extraction (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11192-021-03863-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:126:y:2021:i:3:d:10.1007_s11192-021-03863-9

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192

DOI: 10.1007/s11192-021-03863-9

Access Statistics for this article

Scientometrics is currently edited by Wolfgang Glänzel

More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:scient:v:126:y:2021:i:3:d:10.1007_s11192-021-03863-9