Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection

Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detec...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on multimedia computing communications and applications 2023-06, Vol.19 (2s), p.1-26, Article 87
Hauptverfasser:	Jonna, Sankaraganesh, Medhi, Moushumi, Sahay, Rajiv Ranjan
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer vision problems Computing methodologies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	26
container_issue	2s
container_start_page	1
container_title	ACM transactions on multimedia computing communications and applications
container_volume	19
creator	Jonna, Sankaraganesh Medhi, Moushumi Sahay, Rajiv Ranjan
description	Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.
doi_str_mv	10.1145/3557897
format	Article
fullrecord	<record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3557897</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3557897</sourcerecordid><originalsourceid>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</originalsourceid><addsrcrecordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><source>ACM Digital Library Complete</source><creator>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creator><creatorcontrib>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creatorcontrib><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3557897</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computer vision problems ; Computing methodologies</subject><ispartof>ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87</ispartof><rights>Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</citedby><cites>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</cites><orcidid>0000-0001-9069-8833 ; 0000-0003-0820-0616 ; 0000-0003-4929-4516</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/3557897$$EPDF$$P50$$Gacm$$H</linktopdf><link.rule.ids>314,776,780,2276,27901,27902,40172,76197</link.rule.ids></links><search><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><title>ACM transactions on multimedia computing communications and applications</title><addtitle>ACM TOMM</addtitle><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><subject>Computer vision problems</subject><subject>Computing methodologies</subject><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</recordid><startdate>20230630</startdate><enddate>20230630</enddate><creator>Jonna, Sankaraganesh</creator><creator>Medhi, Moushumi</creator><creator>Sahay, Rajiv Ranjan</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></search><sort><creationdate>20230630</creationdate><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><author>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer vision problems</topic><topic>Computing methodologies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jonna, Sankaraganesh</au><au>Medhi, Moushumi</au><au>Sahay, Rajiv Ranjan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><stitle>ACM TOMM</stitle><date>2023-06-30</date><risdate>2023</risdate><volume>19</volume><issue>2s</issue><spage>1</spage><epage>26</epage><pages>1-26</pages><artnum>87</artnum><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3557897</doi><tpages>26</tpages><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1551-6857
ispartof	ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87
issn	1551-6857 1551-6865
language	eng
recordid	cdi_crossref_primary_10_1145_3557897
source	ACM Digital Library Complete
subjects	Computer vision problems Computing methodologies
title	Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T11%3A01%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Distill-DBDGAN:%20Knowledge%20Distillation%20and%20Adversarial%20Learning%20Framework%20for%20Defocus%20Blur%20Detection&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Jonna,%20Sankaraganesh&rft.date=2023-06-30&rft.volume=19&rft.issue=2s&rft.spage=1&rft.epage=26&rft.pages=1-26&rft.artnum=87&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3557897&rft_dat=%3Cacm_cross%3E3557897%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true