Active learning for screening prioritization in systematic reviews - A simulation study
Gerbrich Ferdinands,
Raoul Schram,
Jonathan de Bruin,
Ayoub Bagheri,
Daniel Leonard Oberski,
Lars Tummers and
Rens van de Schoot
Additional contact information
Daniel Leonard Oberski: Tilburg University
Lars Tummers: Utrecht University
No w6qbg, OSF Preprints from Center for Open Science
Abstract:
Background Conducting a systematic review requires great screening effort. Various tools have been proposed to speed up the process of screening thousands of titles and abstracts by engaging in active learning. In such tools, the reviewer interacts with machine learning software to identify relevant publications as early as possible. To gain a comprehensive understanding of active learning models for reducing workload in systematic reviews, the current study provides a methodical overview of such models. Active learning models were evaluated across four different classification techniques (naive Bayes, logistic regression, support vector machines, and random forest) and two different feature extraction strategies (TF-IDF and doc2vec). Moreover, models were evaluated across six systematic review datasets from various research areas to assess generalizability of active learning models across different research contexts. Methods Performance of the models were assessed by conducting simulations on six systematic review datasets. We defined desirable model performance as maximizing recall while minimizing the number of publications needed to screen. Model performance was evaluated by recall curves, WSS@95, RRF@10, and ATD. Results Within all datasets, the model performance exceeded screening at random order to a great degree. The models reduced the number of publications needed to screen by 91.7% to 63.9%. Conclusions Active learning models for screening prioritization show great potential in reducing the workload in systematic reviews. Overall, the Naive Bayes + TF-IDF model performed the best.
Date: 2020-09-16
New Economics Papers: this item is included in nep-cmp
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://osf.io/download/5f622ab52c735502f6e9e979/
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:osf:osfxxx:w6qbg
DOI: 10.31219/osf.io/w6qbg
Access Statistics for this paper
More papers in OSF Preprints from Center for Open Science
Bibliographic data for series maintained by OSF ().