‘Seed + expand’: a general methodology for detecting publication oeuvres of individual researchers
Linda Reijnhoudt (),
Rodrigo Costas (),
Ed Noyons (),
Katy Börner () and
Andrea Scharnhorst ()
Additional contact information
Linda Reijnhoudt: Royal Netherlands Academy of Arts and Sciences (KNAW)
Rodrigo Costas: Center for Science and Technology Studies (CWTS)-Leiden University
Ed Noyons: Center for Science and Technology Studies (CWTS)-Leiden University
Katy Börner: Royal Netherlands Academy of Arts and Sciences (KNAW)
Andrea Scharnhorst: Royal Netherlands Academy of Arts and Sciences (KNAW)
Scientometrics, 2014, vol. 101, issue 2, No 30, 1403-1417
Abstract:
Abstract The study of science at the individual scholar level requires the disambiguation of author names. The creation of author’s publication oeuvres involves matching the list of unique author names to names used in publication databases. Despite recent progress in the development of unique author identifiers, e.g., ORCID, VIVO, or DAI, author disambiguation remains a key problem when it comes to large-scale bibliometric analysis using data from multiple databases. This study introduces and tests a new methodology called seed + expand for semi-automatic bibliographic data collection for a given set of individual authors. Specifically, we identify the oeuvre of a set of Dutch full professors during the period 1980–2011. In particular, we combine author records from a Dutch National Research Information System (NARCIS) with publication records from the Web of Science. Starting with an initial list of 8,378 names, we identify ‘seed publications’ for each author using five different approaches. Subsequently, we ‘expand’ the set of publications in three different approaches. The different approaches are compared and resulting oeuvres are evaluated on precision and recall using a ‘gold standard’ dataset of authors for which verified publications in the period 2001–2010 are available.
Keywords: Author disambiguation; Publication oeuvre; Scalable methods (search for similar items in EconPapers)
Date: 2014
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://link.springer.com/10.1007/s11192-014-1256-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:101:y:2014:i:2:d:10.1007_s11192-014-1256-0
Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192
DOI: 10.1007/s11192-014-1256-0
Access Statistics for this article
Scientometrics is currently edited by Wolfgang Glänzel
More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().