InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution

Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2024-11, p.1-1
Hauptverfasser:	Fan, Yuanting, Liu, Chengxu, Tian, Ruhao, Qian, Xueming
Format:	Artikel
Sprache:	eng
Schlagworte:	Decoding Electronic mail Feature extraction Generative adversarial networks Image recognition Low-Level Vision Machine Recognition Object detection Pipelines Semantics Super Resolution Superresolution Visualization
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1
container_issue
container_start_page	1
container_title	IEEE transactions on circuits and systems for video technology
container_volume
creator	Fan, Yuanting Liu, Chengxu Tian, Ruhao Qian, Xueming
description	Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.
doi_str_mv	10.1109/TCSVT.2024.3496664
format	Article
fullrecord	<record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_ieee_primary_10750855</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10750855</ieee_id><sourcerecordid>10_1109_TCSVT_2024_3496664</sourcerecordid><originalsourceid>FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</originalsourceid><addsrcrecordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><source>IEEE Electronic Library (IEL)</source><creator>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creator><creatorcontrib>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</creatorcontrib><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2024.3496664</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>IEEE</publisher><subject>Decoding ; Electronic mail ; Feature extraction ; Generative adversarial networks ; Image recognition ; Low-Level Vision ; Machine Recognition ; Object detection ; Pipelines ; Semantics ; Super Resolution ; Superresolution ; Visualization</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0009-0008-6507-666X ; 0009-0005-5834-4927 ; 0000-0001-8023-9465 ; 0000-0002-3173-6307</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27922,27923,54756</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10750855$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</description><subject>Decoding</subject><subject>Electronic mail</subject><subject>Feature extraction</subject><subject>Generative adversarial networks</subject><subject>Image recognition</subject><subject>Low-Level Vision</subject><subject>Machine Recognition</subject><subject>Object detection</subject><subject>Pipelines</subject><subject>Semantics</subject><subject>Super Resolution</subject><subject>Superresolution</subject><subject>Visualization</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkM1Kw0AUhQdRsFZfQFzMC0y9M5mfxJ3UqoVCoQluw2R6R6ekSZmkim9vaiu4uhfO-c7iI-SWw4RzyO6Laf5WTAQIOUlkprWWZ2TElUqZEKDOhx8UZ6ng6pJcdd0GgMtUmhFx86brbeMwXz3QmffBBWx6ukLXDkHcuz407zTf2rqmy2qDrqdfof-gT8F7jEM12Jr-bbAaP7Gm-X6Hka2wa-t9H9rmmlx4W3d4c7pjUjzPiukrWyxf5tPHBXNaKpZU3gCgNlWiHFZSrnVW2cz5CniCfgiMsBos1z51XEBqDWTSmrUBLlDyZEzEcdbFtusi-nIXw9bG75JDebBU_loqD5bKk6UBujtCARH_AUZBqlTyA9wlZaw</recordid><startdate>20241111</startdate><enddate>20241111</enddate><creator>Fan, Yuanting</creator><creator>Liu, Chengxu</creator><creator>Tian, Ruhao</creator><creator>Qian, Xueming</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></search><sort><creationdate>20241111</creationdate><title>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</title><author>Fan, Yuanting ; Liu, Chengxu ; Tian, Ruhao ; Qian, Xueming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c645-3bf700e67b35ceb44d69ba9cfb013efe6772a60a16f8c1208a7094a7d7012e413</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Decoding</topic><topic>Electronic mail</topic><topic>Feature extraction</topic><topic>Generative adversarial networks</topic><topic>Image recognition</topic><topic>Low-Level Vision</topic><topic>Machine Recognition</topic><topic>Object detection</topic><topic>Pipelines</topic><topic>Semantics</topic><topic>Super Resolution</topic><topic>Superresolution</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yuanting</creatorcontrib><creatorcontrib>Liu, Chengxu</creatorcontrib><creatorcontrib>Tian, Ruhao</creatorcontrib><creatorcontrib>Qian, Xueming</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Fan, Yuanting</au><au>Liu, Chengxu</au><au>Tian, Ruhao</au><au>Qian, Xueming</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2024-11-11</date><risdate>2024</risdate><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract>Super-resolution (SR) aims to restore a high-resolution (HR) image from its low-resolution (LR) counterpart. Existing works try to achieve an overall average recovery over all regions to provide better visual quality for human viewing. If we desire to explore the potential that performs super-resolution for machine recognition instead of human viewing, the solution should change accordingly. From this insight, we propose a new SR pipeline, called InstanceSR, which treats each region in the LR image differentially and consumes more resources to focus on the recovery of the foreground region where the instances exist. In particular, InstanceSR consists of an encoder that formulates the LR image into a set of various difficulty tokens according to the instances distribution in each sub-region, and a decoder based on a multi-exit network structure to recover the sub-regions corresponding to various difficulty tokens by consuming different computational resources. Experimental results demonstrate the superiority of the proposed InstanceSR over state-of-the-art models, especially the recovery of regions where instances exist, by extensive quantitative and qualitative evaluations on three widely used benchmarks containing small instances. Besides, the comparisons using SR results on three challenging small object detection benchmarks verify that our InstanceSR can consistently boost the detection accuracy and has great potential for subsequent machine recognition.</abstract><pub>IEEE</pub><doi>10.1109/TCSVT.2024.3496664</doi><tpages>1</tpages><orcidid>https://orcid.org/0009-0008-6507-666X</orcidid><orcidid>https://orcid.org/0009-0005-5834-4927</orcidid><orcidid>https://orcid.org/0000-0001-8023-9465</orcidid><orcidid>https://orcid.org/0000-0002-3173-6307</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1051-8215
ispartof	IEEE transactions on circuits and systems for video technology, 2024-11, p.1-1
issn	1051-8215 1558-2205
language	eng
recordid	cdi_ieee_primary_10750855
source	IEEE Electronic Library (IEL)
subjects	Decoding Electronic mail Feature extraction Generative adversarial networks Image recognition Low-Level Vision Machine Recognition Object detection Pipelines Semantics Super Resolution Superresolution Visualization
title	InstanceSR: Efficient Reconstructing Small Object with Differential Instance-level Super-Resolution
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T02%3A22%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=InstanceSR:%20Efficient%20Reconstructing%20Small%20Object%20with%20Differential%20Instance-level%20Super-Resolution&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Fan,%20Yuanting&rft.date=2024-11-11&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2024.3496664&rft_dat=%3Ccrossref_RIE%3E10_1109_TCSVT_2024_3496664%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=10750855&rfr_iscdi=true