Link prediction in citation networks

Shibata, Naoki; Kajikawa, Yuya; Sakata, Ichiro

Link prediction in citation networks

Naoki Shibata, Yuya Kajikawa and Ichiro Sakata

Journal of the American Society for Information Science and Technology, 2012, vol. 63, issue 1, 78-85

Abstract: In this article, we build models to predict the existence of citations among papers by formulating link prediction for 5 large‐scale datasets of citation networks. The supervised machine‐learning model is applied with 11 features. As a result, our learner performs very well, with the F1 values of between 0.74 and 0.82. Three features in particular, link‐based Jaccard coefficient difference in betweenness centrality, and cosine similarity of term frequency–inverse document frequency vectors, largely affect the predictions of citations. The results also indicate that different models are required for different types of research areas—research fields with a single issue or research fields with multiple issues. In the case of research fields with multiple issues, there are barriers among research fields because our results indicate that papers tend to be cited in each research field locally. Therefore, one must consider the typology of targeted research areas when building models for link prediction in citation networks.

Date: 2012
References: Add references at CitEc
Citations: View citations in EconPapers (17)

Downloads: (external link)
https://doi.org/10.1002/asi.21664

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:jamist:v:63:y:2012:i:1:p:78-85

Ordering information: This journal article can be ordered from
https://doi.org/10.1002/(ISSN)1532-2890

Access Statistics for this article

More articles in Journal of the American Society for Information Science and Technology from Association for Information Science & Technology
Bibliographic data for series maintained by Wiley Content Delivery ().