Positive-Unlabeled Learning for Network Link Prediction

Gan, Shengfeng; Alshahrani, Mohammed; Liu, Shichao

Positive-Unlabeled Learning for Network Link Prediction

Shengfeng Gan, Mohammed Alshahrani and Shichao Liu ()
Additional contact information
Shengfeng Gan: College of Computer, Hubei University of Education, Wuhan 430205, China
Mohammed Alshahrani: College of Computer Science and IT, Albaha University, Albaha 65515, Saudi Arabia
Shichao Liu: College of Informatics, Huazhong Agricultural University, Wuhan 430070, China

Mathematics, 2022, vol. 10, issue 18, 1-13

Abstract: Link prediction is an important problem in network data mining, which is dedicated to predicting the potential relationship between nodes in the network. Normally, network link prediction based on supervised classification will be trained on a dataset consisting of a set of positive samples and a set of negative samples. However, well-labeled training datasets with positive and negative annotations are always inadequate in real-world scenarios, and the datasets contain a large number of unlabeled samples that may hinder the performance of the model. To address this problem, we propose a positive-unlabeled learning framework with network representation for network link prediction only using positive samples and unlabeled samples. We first learn representation vectors of nodes using a network representation method. Next, we concatenate representation vectors of node pairs and then feed them into different classifiers to predict whether the link exists or not. To alleviate data imbalance and enhance the prediction precision, we adopt three types of positive-unlabeled (PU) learning strategies to improve the prediction performance using traditional classifier estimation, bagging strategy and reliable negative sampling. We conduct experiments on three datasets to compare different PU learning methods and discuss their influence on the prediction results. The experimental results demonstrate that PU learning has a positive impact on predictive performances and the promotion effects vary with different network structures.

Keywords: network link prediction; positive-unlabeled learning; network representation learning; supervised classification (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/10/18/3345/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/18/3345/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:18:p:3345-:d:915642

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().