InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution
Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolu...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on circuits and systems for video technology 2024-11, p.1-1 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | IEEE transactions on circuits and systems for video technology |
container_volume | |
creator | Fan, Yuanting Liu, Chengxu Tian, Ruhao Qian, Xueming |
description | Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition. |
doi_str_mv | 10.1109/TCSVT.2024.3496664 |
format | Article |
fullrecord | <record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_ieee_primary_10750855</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10750855</ieee_id><sourcerecordid>10_1109_TCSVT_2024_3496664</sourcerecordid><originalsourceid>FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</originalsourceid><addsrcrecordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><source>IEEE Electronic Library (IEL)</source><creator>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creator><creatorcontrib>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creatorcontrib><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2024.3496664</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>IEEE</publisher><subject>Decoding ; Electronic mail ; Feature extraction ; Generative adversarial networks ; Image recognition ; Low-Level Vision ; Machine Recognition ; Object detection ; Pipelines ; Semantics ; Super Resolution ; Superresolution ; Visualization</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0009-0008-6507-666X ; 0009-0005-5834-4927 ; 0000-0001-8023-9465 ; 0000-0002-3173-6307</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27922,27923,54756</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><subject>Decoding</subject><subject>Electronic mail</subject><subject>Feature extraction</subject><subject>Generative adversarial networks</subject><subject>Image recognition</subject><subject>Low-Level Vision</subject><subject>Machine Recognition</subject><subject>Object detection</subject><subject>Pipelines</subject><subject>Semantics</subject><subject>Super Resolution</subject><subject>Superresolution</subject><subject>Visualization</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</recordid><startdate>20241111</startdate><enddate>20241111</enddate><creator>Fan, Yuanting</creator><creator>Liu, Chengxu</creator><creator>Tian, Ruhao</creator><creator>Qian, Xueming</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></search><sort><creationdate>20241111</creationdate><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><author>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Decoding</topic><topic>Electronic mail</topic><topic>Feature extraction</topic><topic>Generative adversarial networks</topic><topic>Image recognition</topic><topic>Low-Level Vision</topic><topic>Machine Recognition</topic><topic>Object detection</topic><topic>Pipelines</topic><topic>Semantics</topic><topic>Super Resolution</topic><topic>Superresolution</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Fan, Yuanting</au><au>Liu, Chengxu</au><au>Tian, Ruhao</au><au>Qian, Xueming</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2024-11-11</date><risdate>2024</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</abstract><pub>IEEE</pub><doi>10.1109/TCSVT.2024.3496664</doi><tpages>1</tpages><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1051-8215 |
ispartof | IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1 |
issn | 1051-8215 1558-2205 |
language | eng |
recordid | cdi_ieee_primary_10750855 |
source | IEEE Electronic Library (IEL) |
subjects | Decoding Electronic mail Feature extraction Generative adversarial networks Image recognition Low-Level Vision Machine Recognition Object detection Pipelines Semantics Super Resolution Superresolution Visualization |
title | InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T02%3A22%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=InstanceSR:%20Efficient%20Reconstructing%20Small%20Object%20with%20Differential%20Instance-level%20Super-Resolution&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Fan,%20Yuanting&rft.date=2024-11-11&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2024.3496664&rft_dat=%3Ccrossref_RIE%3E10_1109_TCSVT_2024_3496664%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10750855&rfr_iscdi=true |