InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution

Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on circuits and systems for video technology 2024-11, p.1-1
Hauptverfasser: Fan, Yuanting, Liu, Chengxu, Tian, Ruhao, Qian, Xueming
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1
container_issue
container_start_page 1
container_title IEEE transactions on circuits and systems for video technology
container_volume
creator Fan, Yuanting
Liu, Chengxu
Tian, Ruhao
Qian, Xueming
description Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.
doi_str_mv 10.1109/TCSVT.2024.3496664
format Article
fullrecord <record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_ieee_primary_10750855</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10750855</ieee_id><sourcerecordid>10_1109_TCSVT_2024_3496664</sourcerecordid><originalsourceid>FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</originalsourceid><addsrcrecordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><source>IEEE Electronic Library (IEL)</source><creator>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creator><creatorcontrib>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creatorcontrib><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2024.3496664</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>IEEE</publisher><subject>Decoding ; Electronic mail ; Feature extraction ; Generative adversarial networks ; Image recognition ; Low-Level Vision ; Machine Recognition ; Object detection ; Pipelines ; Semantics ; Super Resolution ; Superresolution ; Visualization</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0009-0008-6507-666X ; 0009-0005-5834-4927 ; 0000-0001-8023-9465 ; 0000-0002-3173-6307</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27922,27923,54756</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><subject>Decoding</subject><subject>Electronic mail</subject><subject>Feature extraction</subject><subject>Generative adversarial networks</subject><subject>Image recognition</subject><subject>Low-Level Vision</subject><subject>Machine Recognition</subject><subject>Object detection</subject><subject>Pipelines</subject><subject>Semantics</subject><subject>Super Resolution</subject><subject>Superresolution</subject><subject>Visualization</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</recordid><startdate>20241111</startdate><enddate>20241111</enddate><creator>Fan, Yuanting</creator><creator>Liu, Chengxu</creator><creator>Tian, Ruhao</creator><creator>Qian, Xueming</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></search><sort><creationdate>20241111</creationdate><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><author>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Decoding</topic><topic>Electronic mail</topic><topic>Feature extraction</topic><topic>Generative adversarial networks</topic><topic>Image recognition</topic><topic>Low-Level Vision</topic><topic>Machine Recognition</topic><topic>Object detection</topic><topic>Pipelines</topic><topic>Semantics</topic><topic>Super Resolution</topic><topic>Superresolution</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Fan, Yuanting</au><au>Liu, Chengxu</au><au>Tian, Ruhao</au><au>Qian, Xueming</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2024-11-11</date><risdate>2024</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</abstract><pub>IEEE</pub><doi>10.1109/TCSVT.2024.3496664</doi><tpages>1</tpages><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1051-8215
ispartof IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1
issn 1051-8215
1558-2205
language eng
recordid cdi_ieee_primary_10750855
source IEEE Electronic Library (IEL)
subjects Decoding
Electronic mail
Feature extraction
Generative adversarial networks
Image recognition
Low-Level Vision
Machine Recognition
Object detection
Pipelines
Semantics
Super Resolution
Superresolution
Visualization
title InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T02%3A22%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=InstanceSR:%20Efficient%20Reconstructing%20Small%20Object%20with%20Differential%20Instance-level%20Super-Resolution&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Fan,%20Yuanting&rft.date=2024-11-11&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2024.3496664&rft_dat=%3Ccrossref_RIE%3E10_1109_TCSVT_2024_3496664%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10750855&rfr_iscdi=true