EconPapers    
Economics at your fingertips  
 

Errors in document-type classification: a focus on engineering publications and their publishers

Domenico A. Maisano, Lucrezia Ferrara and Fiorenzo Franceschini ()
Additional contact information
Domenico A. Maisano: Politecnico di Torino, Department of Management and Production Engineering (DIGEP)
Lucrezia Ferrara: Politecnico di Torino, Department of Management and Production Engineering (DIGEP)
Fiorenzo Franceschini: Politecnico di Torino, Department of Management and Production Engineering (DIGEP)

Scientometrics, 2025, vol. 130, issue 11, No 28, 6613-6641

Abstract: Abstract This study investigates document-type (DT) classification errors—such as misclassifications of research articles, reviews, conference proceedings, editorials, etc.—in the bibliometric databases Scopus and Web of Science, focusing on the field of engineering. Such misclassifications can adversely affect academic research and research quality assessments, with potential repercussions on researchers' careers and the allocation of funding in academic institutions. By analysing a corpus of approximately 10,000 publications, a semi-automated approach is employed to identify misclassified documents and attribute errors to the responsible database. Additionally, the role of publishers (e.g., “Elsevier”, “Springer Nature”, “Taylor & Francis”, etc.) is investigated, based on the hypothesis that certain publishers may contribute more significantly than others to DT-classification errors, due to specific editorial practices or inconsistencies in metadata. The results reveal that classification errors are non-negligible (i.e., they occur at rates of a few percentage points) and that the extent of publishers' contributions varies significantly in both Scopus and WoS. However, the most problematic publishers for each database appear to be uncorrelated. Integrating various statistical tests, this study provides insights that may be valuable to researchers, research evaluators, database operators, and publishers, raising awareness of the issue and offering preliminary indications for identifying possible remedies to mitigate such errors and enhance DT-classification accuracy. Future research will further investigate the reasons behind the concentration of errors among certain publishers.

Keywords: Document-type classification; Classification error; Bibliometric databases; Engineering publications; Publisher influence (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11192-025-05444-6 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:scient:v:130:y:2025:i:11:d:10.1007_s11192-025-05444-6

Ordering information: This journal article can be ordered from
http://www.springer.com/economics/journal/11192

DOI: 10.1007/s11192-025-05444-6

Access Statistics for this article

Scientometrics is currently edited by Wolfgang Glänzel

More articles in Scientometrics from Springer, Akadémiai Kiadó
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-12-05
Handle: RePEc:spr:scient:v:130:y:2025:i:11:d:10.1007_s11192-025-05444-6