Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing
By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on image processing 2016-11, Vol.25 (11), p.5427-5440 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 5440 |
---|---|
container_issue | 11 |
container_start_page | 5427 |
container_title | IEEE transactions on image processing |
container_volume | 25 |
creator | Ding, Guiguang Guo, Yuchen Zhou, Jile Gao, Yue |
description | By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH. |
doi_str_mv | 10.1109/TIP.2016.2607421 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_1859722256</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7563341</ieee_id><sourcerecordid>1859722256</sourcerecordid><originalsourceid>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</originalsourceid><addsrcrecordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1859722256</pqid></control><display><type>article</type><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><source>IEEE Electronic Library (IEL)</source><creator>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creator><creatorcontrib>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</creatorcontrib><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2016.2607421</identifier><identifier>PMID: 27623584</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Algorithm design and analysis ; Binary codes ; collective matrix factorization ; cross-modality search ; Hamming distance ; Hashing ; Matrix decomposition ; multimodal data ; Optimization ; scalability ; Semantics</subject><ispartof>IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</citedby><cites>FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</cites><orcidid>0000-0001-9808-9805</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7563341$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27623584$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</description><subject>Algorithm design and analysis</subject><subject>Binary codes</subject><subject>collective matrix factorization</subject><subject>cross-modality search</subject><subject>Hamming distance</subject><subject>Hashing</subject><subject>Matrix decomposition</subject><subject>multimodal data</subject><subject>Optimization</subject><subject>scalability</subject><subject>Semantics</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PwkAQhjdGI4jeTUxMj16Ks9vttj0aIkIC0QQ8b6bbKawpFHcLEX-9JSCnmeT9mMzD2D2HPueQPc_HH30BXPWFgkQKfsG6PJM8BJDist0hTsKEy6zDbrz_AuAy5uqadUSiRBSnssumE3QLCmcGKwoGrvY-nNYFVrbZBzNCZ5bBzmIwqKuKTGN3FEyxcfYnGKJpamd_sbH1OhihX9r14pZdlVh5ujvNHvscvs4Ho3Dy_jYevExCIyPVhFGachCkJMqCDBgpEoKyyEmUHEnleZHKVi8zlJSmEOUIBQEVcVwILDOIeuzp2Ltx9feWfKNX1huqKlxTvfWap3GWCCFi1VrhaDWH5xyVeuPsCt1ec9AHiLqFqA8Q9QliG3k8tW_zFRXnwD-11vBwNFgiOstJey2SPPoDlPt2GA</recordid><startdate>20161101</startdate><enddate>20161101</enddate><creator>Ding, Guiguang</creator><creator>Guo, Yuchen</creator><creator>Zhou, Jile</creator><creator>Gao, Yue</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></search><sort><creationdate>20161101</creationdate><title>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</title><author>Ding, Guiguang ; Guo, Yuchen ; Zhou, Jile ; Gao, Yue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c436t-388102e64a4dec0c427e0fdbe2f1ae6bbd8402ef9a4e8803ba0de0ed55d2af903</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Algorithm design and analysis</topic><topic>Binary codes</topic><topic>collective matrix factorization</topic><topic>cross-modality search</topic><topic>Hamming distance</topic><topic>Hashing</topic><topic>Matrix decomposition</topic><topic>multimodal data</topic><topic>Optimization</topic><topic>scalability</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ding, Guiguang</creatorcontrib><creatorcontrib>Guo, Yuchen</creatorcontrib><creatorcontrib>Zhou, Jile</creatorcontrib><creatorcontrib>Gao, Yue</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Ding, Guiguang</au><au>Guo, Yuchen</au><au>Zhou, Jile</au><au>Gao, Yue</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2016-11-01</date><risdate>2016</risdate><volume>25</volume><issue>11</issue><spage>5427</spage><epage>5440</epage><pages>5427-5440</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>By transforming data into binary representation, i.e., Hashing, we can perform high-speed search with low storage cost, and thus, Hashing has collected increasing research interest in the recent years. Recently, how to generate Hashcode for multimodal data (e.g., images with textual tags, documents with photos, and so on) for large-scale cross-modality search (e.g., searching semantically related images in database for a document query) is an important research issue because of the fast growth of multimodal data in the Web. To address this issue, a novel framework for multimodal Hashing is proposed, termed as Collective Matrix Factorization Hashing (CMFH). The key idea of CMFH is to learn unified Hashcodes for different modalities of one multimodal instance in the shared latent semantic space in which different modalities can be effectively connected. Therefore, accurate cross-modality search is supported. Based on the general framework, we extend it in the unsupervised scenario where it tries to preserve the Euclidean structure, and in the supervised scenario where it fully exploits the label information of data. The corresponding theoretical analysis and the optimization algorithms are given. We conducted comprehensive experiments on three benchmark data sets for cross-modality search. The experimental results demonstrate that CMFH can significantly outperform several state-of-the-art cross-modality Hashing methods, which validates the effectiveness of the proposed CMFH.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>27623584</pmid><doi>10.1109/TIP.2016.2607421</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-9808-9805</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1057-7149 |
ispartof | IEEE transactions on image processing, 2016-11, Vol.25 (11), p.5427-5440 |
issn | 1057-7149 1941-0042 |
language | eng |
recordid | cdi_proquest_miscellaneous_1859722256 |
source | IEEE Electronic Library (IEL) |
subjects | Algorithm design and analysis Binary codes collective matrix factorization cross-modality search Hamming distance Hashing Matrix decomposition multimodal data Optimization scalability Semantics |
title | Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T13%3A10%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-Scale%20Cross-Modality%20Search%20via%20Collective%20Matrix%20Factorization%20Hashing&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Ding,%20Guiguang&rft.date=2016-11-01&rft.volume=25&rft.issue=11&rft.spage=5427&rft.epage=5440&rft.pages=5427-5440&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2016.2607421&rft_dat=%3Cproquest_RIE%3E1859722256%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1859722256&rft_id=info:pmid/27623584&rft_ieee_id=7563341&rfr_iscdi=true |