A Hybrid Recommendation Method with Reduced Data for Large-Scale Application

Most recommendation algorithms attempt to alleviate information overload by identifying which items a user will find worthwhile. Content-based (CB) filtering uses the features of items, whereas collaborative filtering (CF) relies on the opinions of similar customers to recommend items. In addition t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on human-machine systems 2010-09, Vol.40 (5), p.557-566
Hauptverfasser: Sang Hyun Choi, Young-Seon Jeong, Jeong, Myong K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 566
container_issue 5
container_start_page 557
container_title IEEE transactions on human-machine systems
container_volume 40
creator Sang Hyun Choi
Young-Seon Jeong
Jeong, Myong K
description Most recommendation algorithms attempt to alleviate information overload by identifying which items a user will find worthwhile. Content-based (CB) filtering uses the features of items, whereas collaborative filtering (CF) relies on the opinions of similar customers to recommend items. In addition to these techniques, hybrid methods have also been suggested to improve the performance of recommendation algorithms. However, even though recent hybrid methods have helped to avoid certain limitations of CB and CF, scalability and sparsity are still major problems in large-scale recommendation systems. In order to overcome these problems, this paper proposes a novel hybrid recommendation algorithm HYRED, which combines CF using the modified Pearson's binary correlation coefficients with CB filtering using the generalized distance-to-boundary-based rating. In the proposed recommendation system, the nearest and farthest neighbors of a target customer are utilized to yield a reduced dataset of useful information by avoiding scalability and sparsity problem when confronted by tremendous volumes of data. The use of reduced datasets enables us not only to lessen the computing effort, but also to improve the performance of recommendations. In addition, a generalized method to combine CF and CB system into a hybrid recommendation system is proposed by developing on the normalization metric. We have used this HYRED algorithm to experiment with all possible combination of CF and statistical-learning-based CB filtering. These experiments have shown that the use of reduced datasets saves computational time, and neighbor information improves performance.
doi_str_mv 10.1109/TSMCC.2010.2046036
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_pascalfrancis_primary_23173730</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5456227</ieee_id><sourcerecordid>1283696068</sourcerecordid><originalsourceid>FETCH-LOGICAL-c358t-e6bf0d08a3fb485a00270aac17413929c66e86b5973d5480f244e83af01528473</originalsourceid><addsrcrecordid>eNpdkFtLw0AQhYMoWKt_QF8CIviSOnvN7mOplwotgq3PYbuZ2JQ0qbsJ0n_v9kIffNpZ5jtnZk4U3RIYEAL6aT6bjkYDCuFPgUtg8izqESFUQjmn56EGzROp0_QyuvJ-BUA416wXTYbxeLtwZR5_om3Wa6xz05ZNHU-xXTZ5_Fu2y9DKO4t5_GxaExeNiyfGfWMys6bCeLjZVKXdi66ji8JUHm-Obz_6en2Zj8bJ5OPtfTScJJYJ1SYoFwXkoAwrFlwJA0BTMMaSlBOmqbZSopILoVOWC66gCDegYqYAIqjiKetHjwffjWt-OvRtti69xaoyNTadzwhVTGoJUgX0_h-6ajpXh-0yEsZqRhjdGdIDZV3jvcMi27hybdw2QNku4GwfcLYLODsGHEQPR2vjQxKFM7Ut_UlJGUlZyiBwdweuRMRTW3AhaRj9B87CgRE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1027931327</pqid></control><display><type>article</type><title>A Hybrid Recommendation Method with Reduced Data for Large-Scale Application</title><source>IEEE Electronic Library (IEL)</source><creator>Sang Hyun Choi ; Young-Seon Jeong ; Jeong, Myong K</creator><creatorcontrib>Sang Hyun Choi ; Young-Seon Jeong ; Jeong, Myong K</creatorcontrib><description>Most recommendation algorithms attempt to alleviate information overload by identifying which items a user will find worthwhile. Content-based (CB) filtering uses the features of items, whereas collaborative filtering (CF) relies on the opinions of similar customers to recommend items. In addition to these techniques, hybrid methods have also been suggested to improve the performance of recommendation algorithms. However, even though recent hybrid methods have helped to avoid certain limitations of CB and CF, scalability and sparsity are still major problems in large-scale recommendation systems. In order to overcome these problems, this paper proposes a novel hybrid recommendation algorithm HYRED, which combines CF using the modified Pearson's binary correlation coefficients with CB filtering using the generalized distance-to-boundary-based rating. In the proposed recommendation system, the nearest and farthest neighbors of a target customer are utilized to yield a reduced dataset of useful information by avoiding scalability and sparsity problem when confronted by tremendous volumes of data. The use of reduced datasets enables us not only to lessen the computing effort, but also to improve the performance of recommendations. In addition, a generalized method to combine CF and CB system into a hybrid recommendation system is proposed by developing on the normalization metric. We have used this HYRED algorithm to experiment with all possible combination of CF and statistical-learning-based CB filtering. These experiments have shown that the use of reduced datasets saves computational time, and neighbor information improves performance.</description><identifier>ISSN: 1094-6977</identifier><identifier>ISSN: 2168-2291</identifier><identifier>EISSN: 1558-2442</identifier><identifier>EISSN: 2168-2305</identifier><identifier>DOI: 10.1109/TSMCC.2010.2046036</identifier><identifier>CODEN: ITCRFH</identifier><language>eng</language><publisher>New-York, NY: IEEE</publisher><subject>Algorithms ; Applied sciences ; Collaboration ; Computation ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Correlation coefficients ; Customers ; Cybernetics ; Data mining ; Data processing. List processing. Character string processing ; Educational technology ; Electronic commerce ; Exact sciences and technology ; Filtering ; Filtering algorithms ; Filtration ; hybrid recommendation ; Information filtering ; Information filters ; Large-scale systems ; Memory organisation. Data processing ; Performance enhancement ; Scalability ; Software ; Sparsity ; Studies ; Systems engineering and theory ; Terminology</subject><ispartof>IEEE transactions on human-machine systems, 2010-09, Vol.40 (5), p.557-566</ispartof><rights>2015 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Sep 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c358t-e6bf0d08a3fb485a00270aac17413929c66e86b5973d5480f244e83af01528473</citedby><cites>FETCH-LOGICAL-c358t-e6bf0d08a3fb485a00270aac17413929c66e86b5973d5480f244e83af01528473</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5456227$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27915,27916,54749</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5456227$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=23173730$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sang Hyun Choi</creatorcontrib><creatorcontrib>Young-Seon Jeong</creatorcontrib><creatorcontrib>Jeong, Myong K</creatorcontrib><title>A Hybrid Recommendation Method with Reduced Data for Large-Scale Application</title><title>IEEE transactions on human-machine systems</title><addtitle>TSMCC</addtitle><description>Most recommendation algorithms attempt to alleviate information overload by identifying which items a user will find worthwhile. Content-based (CB) filtering uses the features of items, whereas collaborative filtering (CF) relies on the opinions of similar customers to recommend items. In addition to these techniques, hybrid methods have also been suggested to improve the performance of recommendation algorithms. However, even though recent hybrid methods have helped to avoid certain limitations of CB and CF, scalability and sparsity are still major problems in large-scale recommendation systems. In order to overcome these problems, this paper proposes a novel hybrid recommendation algorithm HYRED, which combines CF using the modified Pearson's binary correlation coefficients with CB filtering using the generalized distance-to-boundary-based rating. In the proposed recommendation system, the nearest and farthest neighbors of a target customer are utilized to yield a reduced dataset of useful information by avoiding scalability and sparsity problem when confronted by tremendous volumes of data. The use of reduced datasets enables us not only to lessen the computing effort, but also to improve the performance of recommendations. In addition, a generalized method to combine CF and CB system into a hybrid recommendation system is proposed by developing on the normalization metric. We have used this HYRED algorithm to experiment with all possible combination of CF and statistical-learning-based CB filtering. These experiments have shown that the use of reduced datasets saves computational time, and neighbor information improves performance.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Collaboration</subject><subject>Computation</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Correlation coefficients</subject><subject>Customers</subject><subject>Cybernetics</subject><subject>Data mining</subject><subject>Data processing. List processing. Character string processing</subject><subject>Educational technology</subject><subject>Electronic commerce</subject><subject>Exact sciences and technology</subject><subject>Filtering</subject><subject>Filtering algorithms</subject><subject>Filtration</subject><subject>hybrid recommendation</subject><subject>Information filtering</subject><subject>Information filters</subject><subject>Large-scale systems</subject><subject>Memory organisation. Data processing</subject><subject>Performance enhancement</subject><subject>Scalability</subject><subject>Software</subject><subject>Sparsity</subject><subject>Studies</subject><subject>Systems engineering and theory</subject><subject>Terminology</subject><issn>1094-6977</issn><issn>2168-2291</issn><issn>1558-2442</issn><issn>2168-2305</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkFtLw0AQhYMoWKt_QF8CIviSOnvN7mOplwotgq3PYbuZ2JQ0qbsJ0n_v9kIffNpZ5jtnZk4U3RIYEAL6aT6bjkYDCuFPgUtg8izqESFUQjmn56EGzROp0_QyuvJ-BUA416wXTYbxeLtwZR5_om3Wa6xz05ZNHU-xXTZ5_Fu2y9DKO4t5_GxaExeNiyfGfWMys6bCeLjZVKXdi66ji8JUHm-Obz_6en2Zj8bJ5OPtfTScJJYJ1SYoFwXkoAwrFlwJA0BTMMaSlBOmqbZSopILoVOWC66gCDegYqYAIqjiKetHjwffjWt-OvRtti69xaoyNTadzwhVTGoJUgX0_h-6ajpXh-0yEsZqRhjdGdIDZV3jvcMi27hybdw2QNku4GwfcLYLODsGHEQPR2vjQxKFM7Ut_UlJGUlZyiBwdweuRMRTW3AhaRj9B87CgRE</recordid><startdate>20100901</startdate><enddate>20100901</enddate><creator>Sang Hyun Choi</creator><creator>Young-Seon Jeong</creator><creator>Jeong, Myong K</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>F28</scope></search><sort><creationdate>20100901</creationdate><title>A Hybrid Recommendation Method with Reduced Data for Large-Scale Application</title><author>Sang Hyun Choi ; Young-Seon Jeong ; Jeong, Myong K</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c358t-e6bf0d08a3fb485a00270aac17413929c66e86b5973d5480f244e83af01528473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Collaboration</topic><topic>Computation</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Correlation coefficients</topic><topic>Customers</topic><topic>Cybernetics</topic><topic>Data mining</topic><topic>Data processing. List processing. Character string processing</topic><topic>Educational technology</topic><topic>Electronic commerce</topic><topic>Exact sciences and technology</topic><topic>Filtering</topic><topic>Filtering algorithms</topic><topic>Filtration</topic><topic>hybrid recommendation</topic><topic>Information filtering</topic><topic>Information filters</topic><topic>Large-scale systems</topic><topic>Memory organisation. Data processing</topic><topic>Performance enhancement</topic><topic>Scalability</topic><topic>Software</topic><topic>Sparsity</topic><topic>Studies</topic><topic>Systems engineering and theory</topic><topic>Terminology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sang Hyun Choi</creatorcontrib><creatorcontrib>Young-Seon Jeong</creatorcontrib><creatorcontrib>Jeong, Myong K</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><jtitle>IEEE transactions on human-machine systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sang Hyun Choi</au><au>Young-Seon Jeong</au><au>Jeong, Myong K</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Hybrid Recommendation Method with Reduced Data for Large-Scale Application</atitle><jtitle>IEEE transactions on human-machine systems</jtitle><stitle>TSMCC</stitle><date>2010-09-01</date><risdate>2010</risdate><volume>40</volume><issue>5</issue><spage>557</spage><epage>566</epage><pages>557-566</pages><issn>1094-6977</issn><issn>2168-2291</issn><eissn>1558-2442</eissn><eissn>2168-2305</eissn><coden>ITCRFH</coden><abstract>Most recommendation algorithms attempt to alleviate information overload by identifying which items a user will find worthwhile. Content-based (CB) filtering uses the features of items, whereas collaborative filtering (CF) relies on the opinions of similar customers to recommend items. In addition to these techniques, hybrid methods have also been suggested to improve the performance of recommendation algorithms. However, even though recent hybrid methods have helped to avoid certain limitations of CB and CF, scalability and sparsity are still major problems in large-scale recommendation systems. In order to overcome these problems, this paper proposes a novel hybrid recommendation algorithm HYRED, which combines CF using the modified Pearson's binary correlation coefficients with CB filtering using the generalized distance-to-boundary-based rating. In the proposed recommendation system, the nearest and farthest neighbors of a target customer are utilized to yield a reduced dataset of useful information by avoiding scalability and sparsity problem when confronted by tremendous volumes of data. The use of reduced datasets enables us not only to lessen the computing effort, but also to improve the performance of recommendations. In addition, a generalized method to combine CF and CB system into a hybrid recommendation system is proposed by developing on the normalization metric. We have used this HYRED algorithm to experiment with all possible combination of CF and statistical-learning-based CB filtering. These experiments have shown that the use of reduced datasets saves computational time, and neighbor information improves performance.</abstract><cop>New-York, NY</cop><pub>IEEE</pub><doi>10.1109/TSMCC.2010.2046036</doi><tpages>10</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1094-6977
ispartof IEEE transactions on human-machine systems, 2010-09, Vol.40 (5), p.557-566
issn 1094-6977
2168-2291
1558-2442
2168-2305
language eng
recordid cdi_pascalfrancis_primary_23173730
source IEEE Electronic Library (IEL)
subjects Algorithms
Applied sciences
Collaboration
Computation
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Correlation coefficients
Customers
Cybernetics
Data mining
Data processing. List processing. Character string processing
Educational technology
Electronic commerce
Exact sciences and technology
Filtering
Filtering algorithms
Filtration
hybrid recommendation
Information filtering
Information filters
Large-scale systems
Memory organisation. Data processing
Performance enhancement
Scalability
Software
Sparsity
Studies
Systems engineering and theory
Terminology
title A Hybrid Recommendation Method with Reduced Data for Large-Scale Application
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T05%3A21%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Hybrid%20Recommendation%20Method%20with%20Reduced%20Data%20for%20Large-Scale%20Application&rft.jtitle=IEEE%20transactions%20on%20human-machine%20systems&rft.au=Sang%20Hyun%20Choi&rft.date=2010-09-01&rft.volume=40&rft.issue=5&rft.spage=557&rft.epage=566&rft.pages=557-566&rft.issn=1094-6977&rft.eissn=1558-2442&rft.coden=ITCRFH&rft_id=info:doi/10.1109/TSMCC.2010.2046036&rft_dat=%3Cproquest_RIE%3E1283696068%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1027931327&rft_id=info:pmid/&rft_ieee_id=5456227&rfr_iscdi=true