Improving Data Sparsity in Recommender Systems Using Matrix Regeneration with Item Features
Sang-Min Choi,
Dongwoo Lee,
Kiyoung Jang,
Chihyun Park () and
Suwon Lee ()
Additional contact information
Sang-Min Choi: Department of Computer Science, Gyeongsang National University, Jinju-si 52828, Republic of Korea
Dongwoo Lee: Manager S/W Development Wellxecon Corp., Seoul 06168, Republic of Korea
Kiyoung Jang: Department of Computer Science, Yonsei University, Seoul 03722, Republic of Korea
Chihyun Park: Department of Computer Science and Engineering, Kangwon National University, Chuncheon 24341, Republic of Korea
Suwon Lee: Department of Computer Science, Gyeongsang National University, Jinju-si 52828, Republic of Korea
Mathematics, 2023, vol. 11, issue 2, 1-26
Abstract:
With the development of the Web, users spend more time accessing information that they seek. As a result, recommendation systems have emerged to provide users with preferred contents by filtering abundant information, along with providing means of exposing search results to users more effectively. These recommendation systems operate based on the user reactions to items or on the various user or item features. It is known that recommendation results based on sparse datasets are less reliable because recommender systems operate according to user responses. Thus, we propose a method to improve the dataset sparsity and increase the accuracy of the prediction results by using item features with user responses. A method based on the content-based filtering concept is proposed to extract category rates from the user–item matrix according to the user preferences and to organize these into vectors. Thereafter, we present a method to filter the user–item matrix using the extracted vectors and to regenerate the input matrix for collaborative filtering (CF). We compare the prediction results of our approach and conventional CF using the mean absolute error and root mean square error. Moreover, we calculate the sparsity of the regenerated matrix and the existing input matrix, and demonstrate that the regenerated matrix is more dense than the existing one. By computing the Jaccard similarity between the item sets in the regenerated and existing matrices, we verify the matrix distinctions. The results of the proposed methods confirm that if the regenerated matrix is used as the CF input, a denser matrix with higher predictive accuracy can be constructed than when using conventional methods. The validity of the proposed method was verified by analyzing the effect of the input matrix composed of high average ratings on the CF prediction performance. The low sparsity and high prediction accuracy of the proposed method are verified by comparisons with the results by conventional methods. Improvements of approximately 16% based on K-nearest neighbor and 15% based on singular value decomposition, and a three times improvement in the sparsity based on regenerated and original matrices are obtained. We propose a matrix reconstruction method that can improve the performance of recommendations.
Keywords: recommendation system; collaborative filtering; content-based filtering; data sparsity; matrix regeneration (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
https://www.mdpi.com/2227-7390/11/2/292/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/2/292/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:2:p:292-:d:1026779
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().