Combination of Multiple Global Descriptors for Image Retrieval

Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Jun, HeeJae, Ko, Byungsoo, Kim, Youngjoon, Kim, Insik, Kim, Jongtack
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Jun, HeeJae
Ko, Byungsoo
Kim, Youngjoon
Kim, Insik
Kim, Jongtack
description Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.
doi_str_mv 10.48550/arxiv.1903.10663
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1903_10663</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1903_10663</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</originalsourceid><addsrcrecordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Combination of Multiple Global Descriptors for Image Retrieval</title><source>arXiv.org</source><creator>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creator><creatorcontrib>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creatorcontrib><description>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</description><identifier>DOI: 10.48550/arxiv.1903.10663</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Information Retrieval ; Computer Science - Learning</subject><creationdate>2019-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1903.10663$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1903.10663$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><title>Combination of Multiple Global Descriptors for Image Retrieval</title><description>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Information Retrieval</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</recordid><startdate>20190325</startdate><enddate>20190325</enddate><creator>Jun, HeeJae</creator><creator>Ko, Byungsoo</creator><creator>Kim, Youngjoon</creator><creator>Kim, Insik</creator><creator>Kim, Jongtack</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190325</creationdate><title>Combination of Multiple Global Descriptors for Image Retrieval</title><author>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Information Retrieval</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jun, HeeJae</au><au>Ko, Byungsoo</au><au>Kim, Youngjoon</au><au>Kim, Insik</au><au>Kim, Jongtack</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combination of Multiple Global Descriptors for Image Retrieval</atitle><date>2019-03-25</date><risdate>2019</risdate><abstract>Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper, we propose a novel framework that exploits multiple global descriptors to get an ensemble effect while it can be trained in an end-to-end manner. The proposed framework is flexible and expandable by the global descriptor, CNN backbone, loss, and dataset. Moreover, we investigate the effectiveness of combining multiple global descriptors with quantitative and qualitative analysis. Our extensive experiments show that the combined descriptor outperforms a single global descriptor, as it can utilize different types of feature properties. In the benchmark evaluation, the proposed framework achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop Clothes, and Stanford Online Products on image retrieval tasks. Our model implementations and pretrained models are publicly available.</abstract><doi>10.48550/arxiv.1903.10663</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1903.10663
ispartof
issn
language eng
recordid cdi_arxiv_primary_1903_10663
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
Computer Science - Information Retrieval
Computer Science - Learning
title Combination of Multiple Global Descriptors for Image Retrieval
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T20%3A19%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combination%20of%20Multiple%20Global%20Descriptors%20for%20Image%20Retrieval&rft.au=Jun,%20HeeJae&rft.date=2019-03-25&rft_id=info:doi/10.48550/arxiv.1903.10663&rft_dat=%3Carxiv_GOX%3E1903_10663%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true