Multimodal Recommender Systems: A Survey

The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Liu, Qidong, Hu, Jiaxi, Xiao, Yutian, Zhao, Xiangyu, Gao, Jingtong, Wang, Wanyu, Li, Qing, Tang, Jiliang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Liu, Qidong
Hu, Jiaxi
Xiao, Yutian
Zhao, Xiangyu
Gao, Jingtong
Wang, Wanyu
Li, Qing
Tang, Jiliang
description The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, Multimodal Recommender System (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder, Feature Interaction, Feature Enhancement and Model Optimization. Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.
doi_str_mv 10.48550/arxiv.2302.03883
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2302_03883</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2302_03883</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-3dabb18aebf7910a4c1c530bd5b0ebc3ae099ec7e56dccb5249cae4ae99a1dfe3</originalsourceid><addsrcrecordid>eNotzrsKwjAUgOEsDqI-gJMdXVqTnqZt3ES8gSJo93KSnEKhUUlV7NuLl-nffj7GxoJHSS4ln6F_1c8oBh5HHPIc-mx6eDT32l0tNsGJzNU5uljywblr7-TaebAIzg__pG7IehU2LY3-HbBivSqW23B_3OyWi32IaQYhWNRa5Ei6ypTgmBhhJHBtpeakDSBxpchkJFNrjJZxogxSgqQUClsRDNjkt_1Sy5uvHfqu_JDLLxneTdU8rw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multimodal Recommender Systems: A Survey</title><source>arXiv.org</source><creator>Liu, Qidong ; Hu, Jiaxi ; Xiao, Yutian ; Zhao, Xiangyu ; Gao, Jingtong ; Wang, Wanyu ; Li, Qing ; Tang, Jiliang</creator><creatorcontrib>Liu, Qidong ; Hu, Jiaxi ; Xiao, Yutian ; Zhao, Xiangyu ; Gao, Jingtong ; Wang, Wanyu ; Li, Qing ; Tang, Jiliang</creatorcontrib><description>The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, Multimodal Recommender System (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder, Feature Interaction, Feature Enhancement and Model Optimization. Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.</description><identifier>DOI: 10.48550/arxiv.2302.03883</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Information Retrieval</subject><creationdate>2023-02</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2302.03883$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2302.03883$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Liu, Qidong</creatorcontrib><creatorcontrib>Hu, Jiaxi</creatorcontrib><creatorcontrib>Xiao, Yutian</creatorcontrib><creatorcontrib>Zhao, Xiangyu</creatorcontrib><creatorcontrib>Gao, Jingtong</creatorcontrib><creatorcontrib>Wang, Wanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Tang, Jiliang</creatorcontrib><title>Multimodal Recommender Systems: A Survey</title><description>The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, Multimodal Recommender System (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder, Feature Interaction, Feature Enhancement and Model Optimization. Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Information Retrieval</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzrsKwjAUgOEsDqI-gJMdXVqTnqZt3ES8gSJo93KSnEKhUUlV7NuLl-nffj7GxoJHSS4ln6F_1c8oBh5HHPIc-mx6eDT32l0tNsGJzNU5uljywblr7-TaebAIzg__pG7IehU2LY3-HbBivSqW23B_3OyWi32IaQYhWNRa5Ei6ypTgmBhhJHBtpeakDSBxpchkJFNrjJZxogxSgqQUClsRDNjkt_1Sy5uvHfqu_JDLLxneTdU8rw</recordid><startdate>20230208</startdate><enddate>20230208</enddate><creator>Liu, Qidong</creator><creator>Hu, Jiaxi</creator><creator>Xiao, Yutian</creator><creator>Zhao, Xiangyu</creator><creator>Gao, Jingtong</creator><creator>Wang, Wanyu</creator><creator>Li, Qing</creator><creator>Tang, Jiliang</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230208</creationdate><title>Multimodal Recommender Systems: A Survey</title><author>Liu, Qidong ; Hu, Jiaxi ; Xiao, Yutian ; Zhao, Xiangyu ; Gao, Jingtong ; Wang, Wanyu ; Li, Qing ; Tang, Jiliang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-3dabb18aebf7910a4c1c530bd5b0ebc3ae099ec7e56dccb5249cae4ae99a1dfe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Information Retrieval</topic><toplevel>online_resources</toplevel><creatorcontrib>Liu, Qidong</creatorcontrib><creatorcontrib>Hu, Jiaxi</creatorcontrib><creatorcontrib>Xiao, Yutian</creatorcontrib><creatorcontrib>Zhao, Xiangyu</creatorcontrib><creatorcontrib>Gao, Jingtong</creatorcontrib><creatorcontrib>Wang, Wanyu</creatorcontrib><creatorcontrib>Li, Qing</creatorcontrib><creatorcontrib>Tang, Jiliang</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Liu, Qidong</au><au>Hu, Jiaxi</au><au>Xiao, Yutian</au><au>Zhao, Xiangyu</au><au>Gao, Jingtong</au><au>Wang, Wanyu</au><au>Li, Qing</au><au>Tang, Jiliang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multimodal Recommender Systems: A Survey</atitle><date>2023-02-08</date><risdate>2023</risdate><abstract>The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, Multimodal Recommender System (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder, Feature Interaction, Feature Enhancement and Model Optimization. Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.</abstract><doi>10.48550/arxiv.2302.03883</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2302.03883
ispartof
issn
language eng
recordid cdi_arxiv_primary_2302_03883
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Information Retrieval
title Multimodal Recommender Systems: A Survey
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T17%3A35%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multimodal%20Recommender%20Systems:%20A%20Survey&rft.au=Liu,%20Qidong&rft.date=2023-02-08&rft_id=info:doi/10.48550/arxiv.2302.03883&rft_dat=%3Carxiv_GOX%3E2302_03883%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true