biblioverlap: an R package for document matching across bibliographic datasets
Gabriel Alves Vieira () and
Jacqueline Leta
Additional contact information
Gabriel Alves Vieira: Federal University of Rio de Janeiro
Jacqueline Leta: Federal University of Rio de Janeiro
Scientometrics, 2024, vol. 129, issue 7, No 34, 4513-4527
Abstract:
Abstract Bibliographic databases have long been a cornerstone of scientometrics research, and new information sources have prompted several comparative studies between them. Such studies often employ document-level matching procedures to identify overlaps in the corpus of each database and assess their coverage. However, despite being increasingly relevant in comparative studies, such a type of analysis still lacks an open-source tool to automate it. To fill this gap, we have developed an R package called biblioverlap, which implements a hybrid matching approach using a unique identifier and a selection of ubiquitous bibliographic fields to establish document co-occurrence. It supports data analysis from a broad range of secondary sources and can be used for comparing databases and assessing document overlap in virtually any bibliographic dataset, which can be insightful for various research questions. This paper presents the biblioverlap tool, details the matching procedure’s implementation, and uses an example dataset containing records from the Federal University of Rio de Janeiro to illustrate the package’s built-in functionality.
Keywords: Bibliographic software; Document coverage; R package; Bibliographic databases; Document matching (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11192-024-05065-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:129:y:2024:i:7:d:10.1007_s11192-024-05065-5
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-024-05065-5
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().