Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing

By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing 2016-11, Vol.25 (11), p.5427-5440
Hauptverfasser:	Ding, Guiguang, Guo, Yuchen, Zhou, Jile, Gao, Yue
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithm design and analysis Binary codes collective matrix factorization cross-modality search Hamming distance Hashing Matrix decomposition multimodal data Optimization scalability Semantics
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	5440
container_issue	11
container_start_page	5427
container_title	IEEE transactions on image processing
container_volume	25
creator	Ding, Guiguang Guo, Yuchen Zhou, Jile Gao, Yue
description	By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.
doi_str_mv	10.1109/TIP.2016.2607421
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_1859722256</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7563341</ieee_id><sourcerecordid>1859722256</sourcerecordid><originalsourceid>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</originalsourceid><addsrcrecordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1859722256</pqid></control><display><type>article</type><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><source>IEEE Electronic Library (IEL)</source><creator>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creator><creatorcontrib>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creatorcontrib><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2016.2607421</identifier><identifier>PMID: 27623584</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Algorithm design and analysis ; Binary codes ; collective matrix factorization ; cross-modality search ; Hamming distance ; Hashing ; Matrix decomposition ; multimodal data ; Optimization ; scalability ; Semantics</subject><ispartof>IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</citedby><cites>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</cites><orcidid>0000-0001-9808-9805</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27623584$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><subject>Algorithm design and analysis</subject><subject>Binary codes</subject><subject>collective matrix factorization</subject><subject>cross-modality search</subject><subject>Hamming distance</subject><subject>Hashing</subject><subject>Matrix decomposition</subject><subject>multimodal data</subject><subject>Optimization</subject><subject>scalability</subject><subject>Semantics</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</recordid><startdate>20161101</startdate><enddate>20161101</enddate><creator>Ding, Guiguang</creator><creator>Guo, Yuchen</creator><creator>Zhou, Jile</creator><creator>Gao, Yue</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></search><sort><creationdate>20161101</creationdate><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><author>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithm design and analysis</topic><topic>Binary codes</topic><topic>collective matrix factorization</topic><topic>cross-modality search</topic><topic>Hamming distance</topic><topic>Hashing</topic><topic>Matrix decomposition</topic><topic>multimodal data</topic><topic>Optimization</topic><topic>scalability</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ding, Guiguang</au><au>Guo, Yuchen</au><au>Zhou, Jile</au><au>Gao, Yue</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2016-11-01</date><risdate>2016</risdate><volume>25</volume><issue>11</issue><spage>5427</spage><epage>5440</epage><pages>5427-5440</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>27623584</pmid><doi>10.1109/TIP.2016.2607421</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1057-7149
ispartof	IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440
issn	1057-7149 1941-0042
language	eng
recordid	cdi_proquest_miscellaneous_1859722256
source	IEEE Electronic Library (IEL)
subjects	Algorithm design and analysis Binary codes collective matrix factorization cross-modality search Hamming distance Hashing Matrix decomposition multimodal data Optimization scalability Semantics
title	Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T13%3A10%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-Scale%20Cross-Modality%20Search%20via%20Collective%20Matrix%20Factorization%20Hashing&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Ding,%20Guiguang&rft.date=2016-11-01&rft.volume=25&rft.issue=11&rft.spage=5427&rft.epage=5440&rft.pages=5427-5440&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2016.2607421&rft_dat=%3Cproquest_RIE%3E1859722256%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1859722256&rft_id=info:pmid/27623584&rft_ieee_id=7563341&rfr_iscdi=true