Predicting the evolution of scientific communities by interpretable machine learning approaches
Yunpei Tian,
Gang Li and
Jin Mao
Journal of Informetrics, 2023, vol. 17, issue 2
Abstract:
Scientific communities serve as a fundamental structure of academic activity, and its evolutionary behavior also reveals the development of science. To track the evolution of scientific communities and dig into the mechanism behind it, we formulate the task of event-based Group Evolution Prediction and apply interpretable machine learning approaches to the task. Seven evolution events for prediction are defined based on the evolution chains of scientific communities detected from the collaboration network. By using a detailed feature set, including topological, external, core node, and temporal attributes, Extreme Gradient Boosting, and Random Forest are adopted for the prediction models. Experiments on the dataset of Library and Information Science shows that Random Forest performs the best, with the F1 scores of five events greater than 0.60. Shapley Additive exPlanations measure is applied to interpret the best model, i.e., quantify the contributions of features. It is observed that connectivity within a community has the most crucial influence, and community size, research topic consistency, research topic diversity, average node age, and the ratio of intermediary nodes play vital roles. The proposed methodology offers a solution to unearth the underlying mechanisms of the evolution of scientific communities, and the findings could be useful for scholars and policymakers to monitor scientific communities and take proactive actions.
Keywords: Scientific community; Collaboration network; SHAP; Network evolution; Machine learning (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S175115772300024X
Full text for ScienceDirect subscribers only
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eee:infome:v:17:y:2023:i:2:s175115772300024x
DOI: 10.1016/j.joi.2023.101399
Access Statistics for this article
Journal of Informetrics is currently edited by Leo Egghe
More articles in Journal of Informetrics from Elsevier
Bibliographic data for series maintained by Catherine Liu ().