EconPapers    
Economics at your fingertips  
 

Predicting the evolution of scientific communities by interpretable machine learning approaches

Yunpei Tian, Gang Li and Jin Mao

Journal of Informetrics, 2023, vol. 17, issue 2

Abstract: Scientific communities serve as a fundamental structure of academic activity, and its evolutionary behavior also reveals the development of science. To track the evolution of scientific communities and dig into the mechanism behind it, we formulate the task of event-based Group Evolution Prediction and apply interpretable machine learning approaches to the task. Seven evolution events for prediction are defined based on the evolution chains of scientific communities detected from the collaboration network. By using a detailed feature set, including topological, external, core node, and temporal attributes, Extreme Gradient Boosting, and Random Forest are adopted for the prediction models. Experiments on the dataset of Library and Information Science shows that Random Forest performs the best, with the F1 scores of five events greater than 0.60. Shapley Additive exPlanations measure is applied to interpret the best model, i.e., quantify the contributions of features. It is observed that connectivity within a community has the most crucial influence, and community size, research topic consistency, research topic diversity, average node age, and the ratio of intermediary nodes play vital roles. The proposed methodology offers a solution to unearth the underlying mechanisms of the evolution of scientific communities, and the findings could be useful for scholars and policymakers to monitor scientific communities and take proactive actions.

Keywords: Scientific community; Collaboration network; SHAP; Network evolution; Machine learning (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S175115772300024X
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:infome:v:17:y:2023:i:2:s175115772300024x

DOI: 10.1016/j.joi.2023.101399

Access Statistics for this article

Journal of Informetrics is currently edited by Leo Egghe

More articles in Journal of Informetrics from Elsevier
Bibliographic data for series maintained by Catherine Liu ().

 
Page updated 2025-03-19
Handle: RePEc:eee:infome:v:17:y:2023:i:2:s175115772300024x