Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection
Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detec...
Gespeichert in:
Veröffentlicht in: | ACM transactions on multimedia computing communications and applications 2023-06, Vol.19 (2s), p.1-26, Article 87 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 26 |
---|---|
container_issue | 2s |
container_start_page | 1 |
container_title | ACM transactions on multimedia computing communications and applications |
container_volume | 19 |
creator | Jonna, Sankaraganesh Medhi, Moushumi Sahay, Rajiv Ranjan |
description | Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available. |
doi_str_mv | 10.1145/3557897 |
format | Article |
fullrecord | <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3557897</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3557897</sourcerecordid><originalsourceid>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</originalsourceid><addsrcrecordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><source>ACM Digital Library Complete</source><creator>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creator><creatorcontrib>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</creatorcontrib><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3557897</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computer vision problems ; Computing methodologies</subject><ispartof>ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87</ispartof><rights>Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</citedby><cites>FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</cites><orcidid>0000-0001-9069-8833 ; 0000-0003-0820-0616 ; 0000-0003-4929-4516</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/3557897$$EPDF$$P50$$Gacm$$H</linktopdf><link.rule.ids>314,776,780,2276,27901,27902,40172,76197</link.rule.ids></links><search><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><title>ACM transactions on multimedia computing communications and applications</title><addtitle>ACM TOMM</addtitle><description>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</description><subject>Computer vision problems</subject><subject>Computing methodologies</subject><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNo9kDFPwzAQhS0EEqUgdiZvTIE49tkpW9vQgohggTm6OucqkCbITlvx7yFq6HTv9L73hsfYtYjvhFBwLwFMOjEnbCQARKRTDadHDeacXYTwGcdSg9IjVmZV6Kq6jrJZtpy-PvCXpt3XVK6JDw52VdtwbEo-LXfkA_oKa54T-qZq1nzhcUP71n9x13qekWvtNvBZve2fjmyfvmRnDutAV8Mds4_F4_v8Kcrfls_zaR6hMKqLLKBLVSJBlSVKArLauYmyYNDZGFdJqqWJXSK0oDQBjTYxpFZakk7AGZJjdnvotb4NwZMrvn21Qf9TiLjoxymGcf7ImwOJdnOE_s1fRdBfdQ</recordid><startdate>20230630</startdate><enddate>20230630</enddate><creator>Jonna, Sankaraganesh</creator><creator>Medhi, Moushumi</creator><creator>Sahay, Rajiv Ranjan</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></search><sort><creationdate>20230630</creationdate><title>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</title><author>Jonna, Sankaraganesh ; Medhi, Moushumi ; Sahay, Rajiv Ranjan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a174t-c5af842354dda3e5ec6ff94c57afc0ab286370f2161e8256ac27e4b63e625f7e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer vision problems</topic><topic>Computing methodologies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jonna, Sankaraganesh</creatorcontrib><creatorcontrib>Medhi, Moushumi</creatorcontrib><creatorcontrib>Sahay, Rajiv Ranjan</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jonna, Sankaraganesh</au><au>Medhi, Moushumi</au><au>Sahay, Rajiv Ranjan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><stitle>ACM TOMM</stitle><date>2023-06-30</date><risdate>2023</risdate><volume>19</volume><issue>2s</issue><spage>1</spage><epage>26</epage><pages>1-26</pages><artnum>87</artnum><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>Defocus blur detection (DBD) aims to segment the blurred regions from a given image affected by defocus blur. It is a crucial pre-processing step for various computer vision tasks. With the increasing popularity of small mobile devices, there is a need for a computationally efficient method to detect defocus blur accurately. We propose an efficient defocus blur detection method that estimates the probability of each pixel being focused or blurred in resource-constraint devices. Despite remarkable advances made by the recent deep learning-based methods, they still suffer from several challenges such as background clutter, scale sensitivity, indistinguishable low-contrast focused regions from out-of-focus blur, and especially high computational cost and memory requirement. To address the first three challenges, we develop a novel deep network that efficiently detects blur map from the input blurred image. Specifically, we integrate multi-scale features in the deep network to resolve the scale ambiguities and simultaneously modeled the non-local structural correlations in the high-level blur features. To handle the last two issues, we eventually frame our DBD algorithm to perform knowledge distillation by transferring information from the larger teacher network to a compact student network. All the networks are adversarially trained in an end-to-end manner to enforce higher order consistencies between the output and the target distributions. Experimental results demonstrate the state-of-the-art performance of the larger teacher network, while our proposed lightweight DBD model imitates the output of the teacher network without significant loss in accuracy. The codes, pre-trained model weights, and the results will be made publicly available.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3557897</doi><tpages>26</tpages><orcidid>https://orcid.org/0000-0001-9069-8833</orcidid><orcidid>https://orcid.org/0000-0003-0820-0616</orcidid><orcidid>https://orcid.org/0000-0003-4929-4516</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1551-6857 |
ispartof | ACM transactions on multimedia computing communications and applications, 2023-06, Vol.19 (2s), p.1-26, Article 87 |
issn | 1551-6857 1551-6865 |
language | eng |
recordid | cdi_crossref_primary_10_1145_3557897 |
source | ACM Digital Library Complete |
subjects | Computer vision problems Computing methodologies |
title | Distill-DBDGAN: Knowledge Distillation and Adversarial Learning Framework for Defocus Blur Detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T11%3A01%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Distill-DBDGAN:%20Knowledge%20Distillation%20and%20Adversarial%20Learning%20Framework%20for%20Defocus%20Blur%20Detection&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Jonna,%20Sankaraganesh&rft.date=2023-06-30&rft.volume=19&rft.issue=2s&rft.spage=1&rft.epage=26&rft.pages=1-26&rft.artnum=87&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3557897&rft_dat=%3Cacm_cross%3E3557897%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |