Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification
The difficulty of fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as the eyes and beaks of birds, is a key to the task. However, this is particularly challenging when training data is limited. To address t...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence 2024-11, p.1-16 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 16 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | IEEE transactions on pattern analysis and machine intelligence |
container_volume | |
creator | Lee, SuBeen Moon, WonJun Seong, Hyun Seok Heo, Jae-Pil |
description | The difficulty of fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as the eyes and beaks of birds, is a key to the task. However, this is particularly challenging when training data is limited. To address this, we propose Task Discrepancy Maximization (TDM), a task-oriented channel attention method tailored for fine-grained few-shot classification with two novel modules Support Attention Module (SAM) and Query Attention Module (QAM). SAM highlights channels encoding class-wise discriminative features, while QAM assigns higher weights to object-relevant channels of the query. Based on these submodules, TDM produces task-adaptive features by focusing on channels encoding class-discriminative details and possessed by the query at the same time, for accurate class-sensitive similarity measure between support and query instances. While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM. The merits of TDM and IAM and their complementary benefits are experimentally validated in fine-grained few-shot classification tasks. Moreover, IAM is also effective in coarse-grained and cross-domain few-shot classifications. |
doi_str_mv | 10.1109/TPAMI.2024.3504537 |
format | Article |
fullrecord | <record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TPAMI_2024_3504537</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10763467</ieee_id><sourcerecordid>10_1109_TPAMI_2024_3504537</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1087-bf64da8737533775de53f98f7a816d7de8ad8384a6ea708db42a27435a7821e83</originalsourceid><addsrcrecordid>eNpNkMtKAzEYhYMoWKsvIC7mBVJzT2ZZBnuBSgXH9fB38odG64wkA-LbO7VduDpwON9ZfITcczbjnJWP9cv8eT0TTKiZ1ExpaS_IRHDDaClKcUkmjBtBnRPumtzk_M4YV5rJCVnVkD_oNkXsBvRFtYeuw0MxH4axiH1XhD4Vi9ghXSYYwxcL_Kav-34oqgPkHENs4Ti8JVcBDhnvzjklb4unulrRzXa5ruYb2nLmLN0Fozw4K62W0lrtUctQumDBceOtRwfeSafAIFjm_E4JEFZJDdYJjk5OiTj9tqnPOWFovlL8hPTTcNYcXTR_Lpqji-bsYoQeTlBExH-ANVIZK38BFBVaiw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification</title><source>IEEE</source><creator>Lee, SuBeen ; Moon, WonJun ; Seong, Hyun Seok ; Heo, Jae-Pil</creator><creatorcontrib>Lee, SuBeen ; Moon, WonJun ; Seong, Hyun Seok ; Heo, Jae-Pil</creatorcontrib><description>The difficulty of fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as the eyes and beaks of birds, is a key to the task. However, this is particularly challenging when training data is limited. To address this, we propose Task Discrepancy Maximization (TDM), a task-oriented channel attention method tailored for fine-grained few-shot classification with two novel modules Support Attention Module (SAM) and Query Attention Module (QAM). SAM highlights channels encoding class-wise discriminative features, while QAM assigns higher weights to object-relevant channels of the query. Based on these submodules, TDM produces task-adaptive features by focusing on channels encoding class-discriminative details and possessed by the query at the same time, for accurate class-sensitive similarity measure between support and query instances. While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM. The merits of TDM and IAM and their complementary benefits are experimentally validated in fine-grained few-shot classification tasks. Moreover, IAM is also effective in coarse-grained and cross-domain few-shot classifications.</description><identifier>ISSN: 0162-8828</identifier><identifier>EISSN: 2160-9292</identifier><identifier>DOI: 10.1109/TPAMI.2024.3504537</identifier><identifier>CODEN: ITPIDJ</identifier><language>eng</language><publisher>IEEE</publisher><subject>Attention Module ; Costs ; Feature Alignment ; Feature extraction ; Few-Shot Classification ; Fine-grained Classification ; Moon ; Optimization ; Quadrature amplitude modulation ; Spatial resolution ; Streams ; Time division multiplexing ; Training ; Vectors</subject><ispartof>IEEE transactions on pattern analysis and machine intelligence, 2024-11, p.1-16</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0001-9684-7641 ; 0009-0005-1470-1160 ; 0000-0003-2805-0926 ; 0000-0002-7952-2017</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10763467$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10763467$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Lee, SuBeen</creatorcontrib><creatorcontrib>Moon, WonJun</creatorcontrib><creatorcontrib>Seong, Hyun Seok</creatorcontrib><creatorcontrib>Heo, Jae-Pil</creatorcontrib><title>Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification</title><title>IEEE transactions on pattern analysis and machine intelligence</title><addtitle>TPAMI</addtitle><description>The difficulty of fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as the eyes and beaks of birds, is a key to the task. However, this is particularly challenging when training data is limited. To address this, we propose Task Discrepancy Maximization (TDM), a task-oriented channel attention method tailored for fine-grained few-shot classification with two novel modules Support Attention Module (SAM) and Query Attention Module (QAM). SAM highlights channels encoding class-wise discriminative features, while QAM assigns higher weights to object-relevant channels of the query. Based on these submodules, TDM produces task-adaptive features by focusing on channels encoding class-discriminative details and possessed by the query at the same time, for accurate class-sensitive similarity measure between support and query instances. While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM. The merits of TDM and IAM and their complementary benefits are experimentally validated in fine-grained few-shot classification tasks. Moreover, IAM is also effective in coarse-grained and cross-domain few-shot classifications.</description><subject>Attention Module</subject><subject>Costs</subject><subject>Feature Alignment</subject><subject>Feature extraction</subject><subject>Few-Shot Classification</subject><subject>Fine-grained Classification</subject><subject>Moon</subject><subject>Optimization</subject><subject>Quadrature amplitude modulation</subject><subject>Spatial resolution</subject><subject>Streams</subject><subject>Time division multiplexing</subject><subject>Training</subject><subject>Vectors</subject><issn>0162-8828</issn><issn>2160-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkMtKAzEYhYMoWKsvIC7mBVJzT2ZZBnuBSgXH9fB38odG64wkA-LbO7VduDpwON9ZfITcczbjnJWP9cv8eT0TTKiZ1ExpaS_IRHDDaClKcUkmjBtBnRPumtzk_M4YV5rJCVnVkD_oNkXsBvRFtYeuw0MxH4axiH1XhD4Vi9ghXSYYwxcL_Kav-34oqgPkHENs4Ti8JVcBDhnvzjklb4unulrRzXa5ruYb2nLmLN0Fozw4K62W0lrtUctQumDBceOtRwfeSafAIFjm_E4JEFZJDdYJjk5OiTj9tqnPOWFovlL8hPTTcNYcXTR_Lpqji-bsYoQeTlBExH-ANVIZK38BFBVaiw</recordid><startdate>20241121</startdate><enddate>20241121</enddate><creator>Lee, SuBeen</creator><creator>Moon, WonJun</creator><creator>Seong, Hyun Seok</creator><creator>Heo, Jae-Pil</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9684-7641</orcidid><orcidid>https://orcid.org/0009-0005-1470-1160</orcidid><orcidid>https://orcid.org/0000-0003-2805-0926</orcidid><orcidid>https://orcid.org/0000-0002-7952-2017</orcidid></search><sort><creationdate>20241121</creationdate><title>Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification</title><author>Lee, SuBeen ; Moon, WonJun ; Seong, Hyun Seok ; Heo, Jae-Pil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1087-bf64da8737533775de53f98f7a816d7de8ad8384a6ea708db42a27435a7821e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Attention Module</topic><topic>Costs</topic><topic>Feature Alignment</topic><topic>Feature extraction</topic><topic>Few-Shot Classification</topic><topic>Fine-grained Classification</topic><topic>Moon</topic><topic>Optimization</topic><topic>Quadrature amplitude modulation</topic><topic>Spatial resolution</topic><topic>Streams</topic><topic>Time division multiplexing</topic><topic>Training</topic><topic>Vectors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lee, SuBeen</creatorcontrib><creatorcontrib>Moon, WonJun</creatorcontrib><creatorcontrib>Seong, Hyun Seok</creatorcontrib><creatorcontrib>Heo, Jae-Pil</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE</collection><collection>CrossRef</collection><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lee, SuBeen</au><au>Moon, WonJun</au><au>Seong, Hyun Seok</au><au>Heo, Jae-Pil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification</atitle><jtitle>IEEE transactions on pattern analysis and machine intelligence</jtitle><stitle>TPAMI</stitle><date>2024-11-21</date><risdate>2024</risdate><spage>1</spage><epage>16</epage><pages>1-16</pages><issn>0162-8828</issn><eissn>2160-9292</eissn><coden>ITPIDJ</coden><abstract>The difficulty of fine-grained image classification mainly comes from a shared overall appearance across classes. Thus, recognizing discriminative details, such as the eyes and beaks of birds, is a key to the task. However, this is particularly challenging when training data is limited. To address this, we propose Task Discrepancy Maximization (TDM), a task-oriented channel attention method tailored for fine-grained few-shot classification with two novel modules Support Attention Module (SAM) and Query Attention Module (QAM). SAM highlights channels encoding class-wise discriminative features, while QAM assigns higher weights to object-relevant channels of the query. Based on these submodules, TDM produces task-adaptive features by focusing on channels encoding class-discriminative details and possessed by the query at the same time, for accurate class-sensitive similarity measure between support and query instances. While TDM influences high-level feature maps by task-adaptive calibration of channel-wise importance, we further introduce Instance Attention Module (IAM) operating in intermediate layers of feature extractors to instance-wisely highlight object-relevant channels, by extending QAM. The merits of TDM and IAM and their complementary benefits are experimentally validated in fine-grained few-shot classification tasks. Moreover, IAM is also effective in coarse-grained and cross-domain few-shot classifications.</abstract><pub>IEEE</pub><doi>10.1109/TPAMI.2024.3504537</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0001-9684-7641</orcidid><orcidid>https://orcid.org/0009-0005-1470-1160</orcidid><orcidid>https://orcid.org/0000-0003-2805-0926</orcidid><orcidid>https://orcid.org/0000-0002-7952-2017</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0162-8828 |
ispartof | IEEE transactions on pattern analysis and machine intelligence, 2024-11, p.1-16 |
issn | 0162-8828 2160-9292 |
language | eng |
recordid | cdi_crossref_primary_10_1109_TPAMI_2024_3504537 |
source | IEEE |
subjects | Attention Module Costs Feature Alignment Feature extraction Few-Shot Classification Fine-grained Classification Moon Optimization Quadrature amplitude modulation Spatial resolution Streams Time division multiplexing Training Vectors |
title | Task-Oriented Channel Attention for Fine-Grained Few-Shot Classification |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T14%3A33%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Task-Oriented%20Channel%20Attention%20for%20Fine-Grained%20Few-Shot%20Classification&rft.jtitle=IEEE%20transactions%20on%20pattern%20analysis%20and%20machine%20intelligence&rft.au=Lee,%20SuBeen&rft.date=2024-11-21&rft.spage=1&rft.epage=16&rft.pages=1-16&rft.issn=0162-8828&rft.eissn=2160-9292&rft.coden=ITPIDJ&rft_id=info:doi/10.1109/TPAMI.2024.3504537&rft_dat=%3Ccrossref_RIE%3E10_1109_TPAMI_2024_3504537%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10763467&rfr_iscdi=true |