Combination of Multiple Global Descriptors for Image Retrieval

Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper,...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Jun, HeeJae, Ko, Byungsoo, Kim, Youngjoon, Kim, Insik, Kim, Jongtack
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computer Vision and Pattern Recognition Computer Science - Information Retrieval Computer Science - Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Jun, HeeJae Ko, Byungsoo Kim, Youngjoon Kim, Insik Kim, Jongtack
description	Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.
doi_str_mv	10.48550/arxiv.1903.10663
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1903_10663</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1903_10663</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</originalsourceid><addsrcrecordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Combination of Multiple Global Descriptors for Image Retrieval</title><source>arXiv.org</source><creator>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creator><creatorcontrib>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creatorcontrib><description>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</description><identifier>DOI: 10.48550/arxiv.1903.10663</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Information Retrieval ; Computer Science - Learning</subject><creationdate>2019-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1903.10663$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1903.10663$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><title>Combination of Multiple Global Descriptors for Image Retrieval</title><description>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Information Retrieval</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</recordid><startdate>20190325</startdate><enddate>20190325</enddate><creator>Jun, HeeJae</creator><creator>Ko, Byungsoo</creator><creator>Kim, Youngjoon</creator><creator>Kim, Insik</creator><creator>Kim, Jongtack</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190325</creationdate><title>Combination of Multiple Global Descriptors for Image Retrieval</title><author>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Information Retrieval</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jun, HeeJae</au><au>Ko, Byungsoo</au><au>Kim, Youngjoon</au><au>Kim, Insik</au><au>Kim, Jongtack</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combination of Multiple Global Descriptors for Image Retrieval</atitle><date>2019-03-25</date><risdate>2019</risdate><abstract>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</abstract><doi>10.48550/arxiv.1903.10663</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1903.10663
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1903_10663
source	arXiv.org
subjects	Computer Science - Computer Vision and Pattern Recognition Computer Science - Information Retrieval Computer Science - Learning
title	Combination of Multiple Global Descriptors for Image Retrieval
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T20%3A19%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combination%20of%20Multiple%20Global%20Descriptors%20for%20Image%20Retrieval&rft.au=Jun,%20HeeJae&rft.date=2019-03-25&rft_id=info:doi/10.48550/arxiv.1903.10663&rft_dat=%3Carxiv_GOX%3E1903_10663%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true