Cross-domain recommender system using generalized canonical correlation analysis

Recommender systems provide personalized recommendations to the users from a large number of possible options in online stores. Matrix factorization is a well-known and accurate collaborative filtering approach for recommender system, which suffers from cold-start problem for new users and items. Wh...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge and information systems 2020-12, Vol.62 (12), p.4625-4651
Hauptverfasser: Hashemi, Seyed Mohammad, Rahmati, Mohammad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Recommender systems provide personalized recommendations to the users from a large number of possible options in online stores. Matrix factorization is a well-known and accurate collaborative filtering approach for recommender system, which suffers from cold-start problem for new users and items. When new users join the system, it will take some time before they enter some ratings in the system, until that time, there are not enough ratings to learn the matrix factorization model. Using auxiliary data such as user’s demographic, ratings and reviews in relevant domains, is an effective solution to reduce the new user problem. In this paper, we used the data of users activity from auxiliary domains to build domain-independent users representation that could be used to predict users ratings in the target domains. We proposed an iterative method which applied MAX-VAR generalized canonical correlation analysis (GCCA) on user’s latent factors learned from matrix factorization on each domain. Also, to improve the capability of GCCA to learn latent factors for new users, we propose a generalized canonical correlation analysis by inverse sum of selection matrices (GCCA-ISSM) approach, which provides better recommendations in cold-start scenarios. The proposed approach is extended using content-based features like topic models extracted from user’s reviews. We demonstrate the accuracy and effectiveness of the proposed approaches on cross-domain rating predictions using comprehensive experiments on Amazon and MovieLens datasets.
ISSN:0219-1377
0219-3116
DOI:10.1007/s10115-020-01499-4