A knowledge graph embeddings based approach for author name disambiguation using literals
Cristian Santini (),
Genet Asefa Gesese (),
Silvio Peroni (),
Aldo Gangemi (),
Harald Sack () and
Mehwish Alam ()
Additional contact information
Cristian Santini: FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
Genet Asefa Gesese: FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
Silvio Peroni: University of Bologna
Aldo Gangemi: University of Bologna
Harald Sack: FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
Mehwish Alam: FIZ Karlsruhe – Leibniz Institute for Information Infrastructure
Scientometrics, 2022, vol. 127, issue 8, No 27, 4887-4912
Abstract:
Abstract Scholarly data is growing continuously containing information about the articles from a plethora of venues including conferences, journals, etc. Many initiatives have been taken to make scholarly data available in the form of Knowledge Graphs (KGs). These efforts to standardize these data and make them accessible have also led to many challenges such as exploration of scholarly articles, ambiguous authors, etc. This study more specifically targets the problem of Author Name Disambiguation (AND) on Scholarly KGs and presents a novel framework, Literally Author Name Disambiguation (LAND), which utilizes Knowledge Graph Embeddings (KGEs) using multimodal literal information generated from these KGs. This framework is based on three components: (1) multimodal KGEs, (2) a blocking procedure, and finally, (3) hierarchical Agglomerative Clustering. Extensive experiments have been conducted on two newly created KGs: (i) KG containing information from Scientometrics Journal from 1978 onwards (OC-782K), and (ii) a KG extracted from a well-known benchmark for AND provided by AMiner (AMiner-534K). The results show that our proposed architecture outperforms our baselines of 8–14% in terms of F1 score and shows competitive performances on a challenging benchmark such as AMiner. The code and the datasets are publicly available through Github ( https://github.com/sntcristian/and-kge ) and Zenodo ( https://doi.org/10.5281/zenodo.6309855 ) respectively.
Keywords: Author Name Disambiguation; Bibliographic data; Citation data; Clustering; Knowledge graph embeddings; Open citations (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://link.springer.com/10.1007/s11192-022-04426-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:127:y:2022:i:8:d:10.1007_s11192-022-04426-2
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-022-04426-2
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().