Combination of Multiple Global Descriptors for Image Retrieval
Recent studies in image retrieval task have shown that ensembling different models and combining multiple global descriptors lead to performance improvement. However, training different models for the ensemble is not only difficult but also inefficient with respect to time and memory. In this paper,...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Jun, HeeJae Ko, Byungsoo Kim, Youngjoon Kim, Insik Kim, Jongtack |
description | Recent studies in image retrieval task have shown that ensembling different
models and combining multiple global descriptors lead to performance
improvement. However, training different models for the ensemble is not only
difficult but also inefficient with respect to time and memory. In this paper,
we propose a novel framework that exploits multiple global descriptors to get
an ensemble effect while it can be trained in an end-to-end manner. The
proposed framework is flexible and expandable by the global descriptor, CNN
backbone, loss, and dataset. Moreover, we investigate the effectiveness of
combining multiple global descriptors with quantitative and qualitative
analysis. Our extensive experiments show that the combined descriptor
outperforms a single global descriptor, as it can utilize different types of
feature properties. In the benchmark evaluation, the proposed framework
achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop
Clothes, and Stanford Online Products on image retrieval tasks. Our model
implementations and pretrained models are publicly available. |
doi_str_mv | 10.48550/arxiv.1903.10663 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1903_10663</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1903_10663</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</originalsourceid><addsrcrecordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Combination of Multiple Global Descriptors for Image Retrieval</title><source>arXiv.org</source><creator>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creator><creatorcontrib>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</creatorcontrib><description>Recent studies in image retrieval task have shown that ensembling different
models and combining multiple global descriptors lead to performance
improvement. However, training different models for the ensemble is not only
difficult but also inefficient with respect to time and memory. In this paper,
we propose a novel framework that exploits multiple global descriptors to get
an ensemble effect while it can be trained in an end-to-end manner. The
proposed framework is flexible and expandable by the global descriptor, CNN
backbone, loss, and dataset. Moreover, we investigate the effectiveness of
combining multiple global descriptors with quantitative and qualitative
analysis. Our extensive experiments show that the combined descriptor
outperforms a single global descriptor, as it can utilize different types of
feature properties. In the benchmark evaluation, the proposed framework
achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop
Clothes, and Stanford Online Products on image retrieval tasks. Our model
implementations and pretrained models are publicly available.</description><identifier>DOI: 10.48550/arxiv.1903.10663</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition ; Computer Science - Information Retrieval ; Computer Science - Learning</subject><creationdate>2019-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1903.10663$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1903.10663$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><title>Combination of Multiple Global Descriptors for Image Retrieval</title><description>Recent studies in image retrieval task have shown that ensembling different
models and combining multiple global descriptors lead to performance
improvement. However, training different models for the ensemble is not only
difficult but also inefficient with respect to time and memory. In this paper,
we propose a novel framework that exploits multiple global descriptors to get
an ensemble effect while it can be trained in an end-to-end manner. The
proposed framework is flexible and expandable by the global descriptor, CNN
backbone, loss, and dataset. Moreover, we investigate the effectiveness of
combining multiple global descriptors with quantitative and qualitative
analysis. Our extensive experiments show that the combined descriptor
outperforms a single global descriptor, as it can utilize different types of
feature properties. In the benchmark evaluation, the proposed framework
achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop
Clothes, and Stanford Online Products on image retrieval tasks. Our model
implementations and pretrained models are publicly available.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Computer Science - Information Retrieval</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FLwzAYxvFcPMj0A3gyX6A16duk6UWQqnOwMRi7lzfNGwmkS0nr0G-vTk_P__TAj7E7KcraKCUeMH-GcylbAaUUWsM1e-zSaMMJl5BOPHm--4hLmCLxdUwWI3-mechhWlKeuU-Zb0Z8J36gJQc6Y7xhVx7jTLf_u2LH15dj91Zs9-tN97QtUDdQyIqk8WBb54VXXjbOkaNBmsppqiv9E1YpMG0rtbWqHpS1ACiqRuNgag0rdv93ewH0Uw4j5q_-F9JfIPANeKFC8g</recordid><startdate>20190325</startdate><enddate>20190325</enddate><creator>Jun, HeeJae</creator><creator>Ko, Byungsoo</creator><creator>Kim, Youngjoon</creator><creator>Kim, Insik</creator><creator>Kim, Jongtack</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20190325</creationdate><title>Combination of Multiple Global Descriptors for Image Retrieval</title><author>Jun, HeeJae ; Ko, Byungsoo ; Kim, Youngjoon ; Kim, Insik ; Kim, Jongtack</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-12e18f3b9df0f5f17ddedec182d6e426182b55389916bb54c5bb33a0276ac8463</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Computer Science - Information Retrieval</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Jun, HeeJae</creatorcontrib><creatorcontrib>Ko, Byungsoo</creatorcontrib><creatorcontrib>Kim, Youngjoon</creatorcontrib><creatorcontrib>Kim, Insik</creatorcontrib><creatorcontrib>Kim, Jongtack</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Jun, HeeJae</au><au>Ko, Byungsoo</au><au>Kim, Youngjoon</au><au>Kim, Insik</au><au>Kim, Jongtack</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combination of Multiple Global Descriptors for Image Retrieval</atitle><date>2019-03-25</date><risdate>2019</risdate><abstract>Recent studies in image retrieval task have shown that ensembling different
models and combining multiple global descriptors lead to performance
improvement. However, training different models for the ensemble is not only
difficult but also inefficient with respect to time and memory. In this paper,
we propose a novel framework that exploits multiple global descriptors to get
an ensemble effect while it can be trained in an end-to-end manner. The
proposed framework is flexible and expandable by the global descriptor, CNN
backbone, loss, and dataset. Moreover, we investigate the effectiveness of
combining multiple global descriptors with quantitative and qualitative
analysis. Our extensive experiments show that the combined descriptor
outperforms a single global descriptor, as it can utilize different types of
feature properties. In the benchmark evaluation, the proposed framework
achieves the state-of-the-art performance on the CARS196, CUB200-2011, In-shop
Clothes, and Stanford Online Products on image retrieval tasks. Our model
implementations and pretrained models are publicly available.</abstract><doi>10.48550/arxiv.1903.10663</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.1903.10663 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_1903_10663 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition Computer Science - Information Retrieval Computer Science - Learning |
title | Combination of Multiple Global Descriptors for Image Retrieval |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T20%3A19%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combination%20of%20Multiple%20Global%20Descriptors%20for%20Image%20Retrieval&rft.au=Jun,%20HeeJae&rft.date=2019-03-25&rft_id=info:doi/10.48550/arxiv.1903.10663&rft_dat=%3Carxiv_GOX%3E1903_10663%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |