Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection

Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detec...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on multimedia computing communications and applications 2023-06, Vol.19 (2s), p.1-26, Article 87
Hauptverfasser: Jonna, Sankaraganesh, Medhi, Moushumi, Sahay, Rajiv Ranjan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 26
container_issue 2s
container_start_page 1
container_title ACM transactions on multimedia computing communications and applications
container_volume 19
creator Jonna, Sankaraganesh
Medhi, Moushumi
Sahay, Rajiv Ranjan
description Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.
doi_str_mv 10.1145/3557897
format Article
fullrecord <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3557897</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3557897</sourcerecordid><originalsourceid>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</originalsourceid><addsrcrecordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><source>ACM Digital Library Complete</source><creator>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creator><creatorcontrib>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creatorcontrib><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3557897</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computer vision problems ; Computing methodologies</subject><ispartof>ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87</ispartof><rights>Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</citedby><cites>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</cites><orcidid>0000-0001-9069-8833 ; 0000-0003-0820-0616 ; 0000-0003-4929-4516</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/3557897$$EPDF$$P50$$Gacm$$H</linktopdf><link.rule.ids>314,776,780,2276,27901,27902,40172,76197</link.rule.ids></links><search><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><title>ACM transactions on multimedia computing communications and applications</title><addtitle>ACM TOMM</addtitle><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><subject>Computer vision problems</subject><subject>Computing methodologies</subject><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</recordid><startdate>20230630</startdate><enddate>20230630</enddate><creator>Jonna, Sankaraganesh</creator><creator>Medhi, Moushumi</creator><creator>Sahay, Rajiv Ranjan</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></search><sort><creationdate>20230630</creationdate><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><author>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer vision problems</topic><topic>Computing methodologies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jonna, Sankaraganesh</au><au>Medhi, Moushumi</au><au>Sahay, Rajiv Ranjan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><stitle>ACM TOMM</stitle><date>2023-06-30</date><risdate>2023</risdate><volume>19</volume><issue>2s</issue><spage>1</spage><epage>26</epage><pages>1-26</pages><artnum>87</artnum><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3557897</doi><tpages>26</tpages><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1551-6857
ispartof ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87
issn 1551-6857
1551-6865
language eng
recordid cdi_crossref_primary_10_1145_3557897
source ACM Digital Library Complete
subjects Computer vision problems
Computing methodologies
title Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T11%3A01%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Distill-DBDGAN:%20Knowledge%20Distillation%20and%20Adversarial%20Learning%20Framework%20for%20Defocus%20Blur%20Detection&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Jonna,%20Sankaraganesh&rft.date=2023-06-30&rft.volume=19&rft.issue=2s&rft.spage=1&rft.epage=26&rft.pages=1-26&rft.artnum=87&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3557897&rft_dat=%3Cacm_cross%3E3557897%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true