Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing

By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing 2016-11, Vol.25 (11), p.5427-5440
Hauptverfasser: Ding, Guiguang, Guo, Yuchen, Zhou, Jile, Gao, Yue
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 5440
container_issue 11
container_start_page 5427
container_title IEEE transactions on image processing
container_volume 25
creator Ding, Guiguang
Guo, Yuchen
Zhou, Jile
Gao, Yue
description By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.
doi_str_mv 10.1109/TIP.2016.2607421
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_1859722256</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7563341</ieee_id><sourcerecordid>1859722256</sourcerecordid><originalsourceid>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</originalsourceid><addsrcrecordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1859722256</pqid></control><display><type>article</type><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><source>IEEE Electronic Library (IEL)</source><creator>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creator><creatorcontrib>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creatorcontrib><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2016.2607421</identifier><identifier>PMID: 27623584</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Algorithm design and analysis ; Binary codes ; collective matrix factorization ; cross-modality search ; Hamming distance ; Hashing ; Matrix decomposition ; multimodal data ; Optimization ; scalability ; Semantics</subject><ispartof>IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</citedby><cites>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</cites><orcidid>0000-0001-9808-9805</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27623584$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><subject>Algorithm design and analysis</subject><subject>Binary codes</subject><subject>collective matrix factorization</subject><subject>cross-modality search</subject><subject>Hamming distance</subject><subject>Hashing</subject><subject>Matrix decomposition</subject><subject>multimodal data</subject><subject>Optimization</subject><subject>scalability</subject><subject>Semantics</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</recordid><startdate>20161101</startdate><enddate>20161101</enddate><creator>Ding, Guiguang</creator><creator>Guo, Yuchen</creator><creator>Zhou, Jile</creator><creator>Gao, Yue</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></search><sort><creationdate>20161101</creationdate><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><author>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithm design and analysis</topic><topic>Binary codes</topic><topic>collective matrix factorization</topic><topic>cross-modality search</topic><topic>Hamming distance</topic><topic>Hashing</topic><topic>Matrix decomposition</topic><topic>multimodal data</topic><topic>Optimization</topic><topic>scalability</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ding, Guiguang</au><au>Guo, Yuchen</au><au>Zhou, Jile</au><au>Gao, Yue</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2016-11-01</date><risdate>2016</risdate><volume>25</volume><issue>11</issue><spage>5427</spage><epage>5440</epage><pages>5427-5440</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>27623584</pmid><doi>10.1109/TIP.2016.2607421</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1057-7149
ispartof IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440
issn 1057-7149
1941-0042
language eng
recordid cdi_proquest_miscellaneous_1859722256
source IEEE Electronic Library (IEL)
subjects Algorithm design and analysis
Binary codes
collective matrix factorization
cross-modality search
Hamming distance
Hashing
Matrix decomposition
multimodal data
Optimization
scalability
Semantics
title Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T13%3A10%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-Scale%20Cross-Modality%20Search%20via%20Collective%20Matrix%20Factorization%20Hashing&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Ding,%20Guiguang&rft.date=2016-11-01&rft.volume=25&rft.issue=11&rft.spage=5427&rft.epage=5440&rft.pages=5427-5440&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2016.2607421&rft_dat=%3Cproquest_RIE%3E1859722256%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1859722256&rft_id=info:pmid/27623584&rft_ieee_id=7563341&rfr_iscdi=true