Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness

Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all co...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing 2021-01, Vol.PP, p.1-1
Hauptverfasser:	Zhu, Manyu, He, Dongliang, Li, Xin, Lia, Chao, Lib, Fu, Liu, Xiao, Ding, Errui, Zhang, Zhaoxiang
Format:	Artikel
Sprache:	eng
Schlagworte:	Cascaded Refinement Coders Convolution Decoding Dynamic Filtering Encoders-Decoders Feature extraction Image Inpainting Image reconstruction Kernel Mask Awareness Scaling factors Shape Task analysis
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1
container_issue
container_start_page	1
container_title	IEEE transactions on image processing
container_volume	PP
creator	Zhu, Manyu He, Dongliang Li, Xin Lia, Chao Lib, Fu Liu, Xiao Ding, Errui Zhang, Zhaoxiang
description	Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all convolution windows (or regions), including those with various shapes of missing pixels, are treated equally and filtered with fixed learned kernels. To this end, we propose our novel mask-aware inpainting solution. Firstly, a Mask-Aware Dynamic Filtering (MADF) module is designed to effectively learn multi-scale features for missing regions in the encoding phase. Specifically, filters for each convolution window are generated from features of the corresponding region of the mask. The second fold of mask awareness is achieved by adopting Point-wise Normalization (PN) in our decoding phase, considering that statistical natures of features at masked points differentiate from those of unmasked points. The proposed PN can tackle this issue by dynamically assigning point-wise scaling factor and bias. Lastly, our model is designed to be an end-to-end cascaded refinement one. Supervision information such as reconstruction loss, perceptual loss and total variation loss is incrementally leveraged to boost the inpainting results from coarse to fine. Effectiveness of the proposed framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets including Places2, CelebA and Paris StreetView.
doi_str_mv	10.1109/TIP.2021.3076310
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_9423556</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9423556</ieee_id><sourcerecordid>2522396901</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-ae0f0f26c63717f4f9714c5346beb10a6dcca75df230a4a49d2fadb8c2e5d1953</originalsourceid><addsrcrecordid>eNpdkE1Lw0AQhhdRtH7cBUECXrykzn4l7lFKqwFFET2Hze6sRptNzSYU_71bW3vw9A7MM8PLQ8gphTGloK5eiqcxA0bHHPKMU9ghI6oETQEE240zyDzNqVAH5DCEDwAqJM32yQHnSkiRqxGZFY1-w6TwC137vvZvSfWdTL1N-zaNkUx0MNqiTZ7R1R4b9H2yrPv35EGHz-RmqTv0GMIx2XN6HvBkk0fkdTZ9mdyl94-3xeTmPjVc5H2qERw4lpmM5zR3wqnYzkgusgorCjqzxuhcWsc4aKGFssxpW10bhtJSJfkRuVz_XXTt14ChL5s6GJzPtcd2CCWTjHGVKaARvfiHfrRD52O7X0pFY0xFCtaU6doQOnTloqsb3X2XFMqV4zI6LleOy43jeHK-eTxUDdrtwZ_UCJytgRoRt2slGJcy4z_Z3n4f</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2522907629</pqid></control><display><type>article</type><title>Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness</title><source>IEEE Electronic Library (IEL)</source><creator>Zhu, Manyu ; He, Dongliang ; Li, Xin ; Lia, Chao ; Lib, Fu ; Liu, Xiao ; Ding, Errui ; Zhang, Zhaoxiang</creator><creatorcontrib>Zhu, Manyu ; He, Dongliang ; Li, Xin ; Lia, Chao ; Lib, Fu ; Liu, Xiao ; Ding, Errui ; Zhang, Zhaoxiang</creatorcontrib><description>Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all convolution windows (or regions), including those with various shapes of missing pixels, are treated equally and filtered with fixed learned kernels. To this end, we propose our novel mask-aware inpainting solution. Firstly, a Mask-Aware Dynamic Filtering (MADF) module is designed to effectively learn multi-scale features for missing regions in the encoding phase. Specifically, filters for each convolution window are generated from features of the corresponding region of the mask. The second fold of mask awareness is achieved by adopting Point-wise Normalization (PN) in our decoding phase, considering that statistical natures of features at masked points differentiate from those of unmasked points. The proposed PN can tackle this issue by dynamically assigning point-wise scaling factor and bias. Lastly, our model is designed to be an end-to-end cascaded refinement one. Supervision information such as reconstruction loss, perceptual loss and total variation loss is incrementally leveraged to boost the inpainting results from coarse to fine. Effectiveness of the proposed framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets including Places2, CelebA and Paris StreetView.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2021.3076310</identifier><identifier>PMID: 33945479</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Cascaded Refinement ; Coders ; Convolution ; Decoding ; Dynamic Filtering ; Encoders-Decoders ; Feature extraction ; Image Inpainting ; Image reconstruction ; Kernel ; Mask Awareness ; Scaling factors ; Shape ; Task analysis</subject><ispartof>IEEE transactions on image processing, 2021-01, Vol.PP, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-ae0f0f26c63717f4f9714c5346beb10a6dcca75df230a4a49d2fadb8c2e5d1953</citedby><cites>FETCH-LOGICAL-c347t-ae0f0f26c63717f4f9714c5346beb10a6dcca75df230a4a49d2fadb8c2e5d1953</cites><orcidid>0000-0002-1129-8610 ; 0000-0002-1867-5378 ; 0000-0003-2648-3875 ; 0000-0001-5547-3671</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9423556$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9423556$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33945479$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhu, Manyu</creatorcontrib><creatorcontrib>He, Dongliang</creatorcontrib><creatorcontrib>Li, Xin</creatorcontrib><creatorcontrib>Lia, Chao</creatorcontrib><creatorcontrib>Lib, Fu</creatorcontrib><creatorcontrib>Liu, Xiao</creatorcontrib><creatorcontrib>Ding, Errui</creatorcontrib><creatorcontrib>Zhang, Zhaoxiang</creatorcontrib><title>Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all convolution windows (or regions), including those with various shapes of missing pixels, are treated equally and filtered with fixed learned kernels. To this end, we propose our novel mask-aware inpainting solution. Firstly, a Mask-Aware Dynamic Filtering (MADF) module is designed to effectively learn multi-scale features for missing regions in the encoding phase. Specifically, filters for each convolution window are generated from features of the corresponding region of the mask. The second fold of mask awareness is achieved by adopting Point-wise Normalization (PN) in our decoding phase, considering that statistical natures of features at masked points differentiate from those of unmasked points. The proposed PN can tackle this issue by dynamically assigning point-wise scaling factor and bias. Lastly, our model is designed to be an end-to-end cascaded refinement one. Supervision information such as reconstruction loss, perceptual loss and total variation loss is incrementally leveraged to boost the inpainting results from coarse to fine. Effectiveness of the proposed framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets including Places2, CelebA and Paris StreetView.</description><subject>Cascaded Refinement</subject><subject>Coders</subject><subject>Convolution</subject><subject>Decoding</subject><subject>Dynamic Filtering</subject><subject>Encoders-Decoders</subject><subject>Feature extraction</subject><subject>Image Inpainting</subject><subject>Image reconstruction</subject><subject>Kernel</subject><subject>Mask Awareness</subject><subject>Scaling factors</subject><subject>Shape</subject><subject>Task analysis</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkE1Lw0AQhhdRtH7cBUECXrykzn4l7lFKqwFFET2Hze6sRptNzSYU_71bW3vw9A7MM8PLQ8gphTGloK5eiqcxA0bHHPKMU9ghI6oETQEE240zyDzNqVAH5DCEDwAqJM32yQHnSkiRqxGZFY1-w6TwC137vvZvSfWdTL1N-zaNkUx0MNqiTZ7R1R4b9H2yrPv35EGHz-RmqTv0GMIx2XN6HvBkk0fkdTZ9mdyl94-3xeTmPjVc5H2qERw4lpmM5zR3wqnYzkgusgorCjqzxuhcWsc4aKGFssxpW10bhtJSJfkRuVz_XXTt14ChL5s6GJzPtcd2CCWTjHGVKaARvfiHfrRD52O7X0pFY0xFCtaU6doQOnTloqsb3X2XFMqV4zI6LleOy43jeHK-eTxUDdrtwZ_UCJytgRoRt2slGJcy4z_Z3n4f</recordid><startdate>20210101</startdate><enddate>20210101</enddate><creator>Zhu, Manyu</creator><creator>He, Dongliang</creator><creator>Li, Xin</creator><creator>Lia, Chao</creator><creator>Lib, Fu</creator><creator>Liu, Xiao</creator><creator>Ding, Errui</creator><creator>Zhang, Zhaoxiang</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-1129-8610</orcidid><orcidid>https://orcid.org/0000-0002-1867-5378</orcidid><orcidid>https://orcid.org/0000-0003-2648-3875</orcidid><orcidid>https://orcid.org/0000-0001-5547-3671</orcidid></search><sort><creationdate>20210101</creationdate><title>Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness</title><author>Zhu, Manyu ; He, Dongliang ; Li, Xin ; Lia, Chao ; Lib, Fu ; Liu, Xiao ; Ding, Errui ; Zhang, Zhaoxiang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-ae0f0f26c63717f4f9714c5346beb10a6dcca75df230a4a49d2fadb8c2e5d1953</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Cascaded Refinement</topic><topic>Coders</topic><topic>Convolution</topic><topic>Decoding</topic><topic>Dynamic Filtering</topic><topic>Encoders-Decoders</topic><topic>Feature extraction</topic><topic>Image Inpainting</topic><topic>Image reconstruction</topic><topic>Kernel</topic><topic>Mask Awareness</topic><topic>Scaling factors</topic><topic>Shape</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Manyu</creatorcontrib><creatorcontrib>He, Dongliang</creatorcontrib><creatorcontrib>Li, Xin</creatorcontrib><creatorcontrib>Lia, Chao</creatorcontrib><creatorcontrib>Lib, Fu</creatorcontrib><creatorcontrib>Liu, Xiao</creatorcontrib><creatorcontrib>Ding, Errui</creatorcontrib><creatorcontrib>Zhang, Zhaoxiang</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhu, Manyu</au><au>He, Dongliang</au><au>Li, Xin</au><au>Lia, Chao</au><au>Lib, Fu</au><au>Liu, Xiao</au><au>Ding, Errui</au><au>Zhang, Zhaoxiang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2021-01-01</date><risdate>2021</risdate><volume>PP</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial. Though U-shaped encoder-decoder frameworks have been witnessed to be successful, most of them share a common drawback of mask unawareness in feature extraction because all convolution windows (or regions), including those with various shapes of missing pixels, are treated equally and filtered with fixed learned kernels. To this end, we propose our novel mask-aware inpainting solution. Firstly, a Mask-Aware Dynamic Filtering (MADF) module is designed to effectively learn multi-scale features for missing regions in the encoding phase. Specifically, filters for each convolution window are generated from features of the corresponding region of the mask. The second fold of mask awareness is achieved by adopting Point-wise Normalization (PN) in our decoding phase, considering that statistical natures of features at masked points differentiate from those of unmasked points. The proposed PN can tackle this issue by dynamically assigning point-wise scaling factor and bias. Lastly, our model is designed to be an end-to-end cascaded refinement one. Supervision information such as reconstruction loss, perceptual loss and total variation loss is incrementally leveraged to boost the inpainting results from coarse to fine. Effectiveness of the proposed framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets including Places2, CelebA and Paris StreetView.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>33945479</pmid><doi>10.1109/TIP.2021.3076310</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-1129-8610</orcidid><orcidid>https://orcid.org/0000-0002-1867-5378</orcidid><orcidid>https://orcid.org/0000-0003-2648-3875</orcidid><orcidid>https://orcid.org/0000-0001-5547-3671</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1057-7149
ispartof	IEEE transactions on image processing, 2021-01, Vol.PP, p.1-1
issn	1057-7149 1941-0042
language	eng
recordid	cdi_ieee_primary_9423556
source	IEEE Electronic Library (IEL)
subjects	Cascaded Refinement Coders Convolution Decoding Dynamic Filtering Encoders-Decoders Feature extraction Image Inpainting Image reconstruction Kernel Mask Awareness Scaling factors Shape Task analysis
title	Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T13%3A43%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Image%20Inpainting%20by%20End-to-End%20Cascaded%20Refinement%20with%20Mask%20Awareness&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Zhu,%20Manyu&rft.date=2021-01-01&rft.volume=PP&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2021.3076310&rft_dat=%3Cproquest_RIE%3E2522396901%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2522907629&rft_id=info:pmid/33945479&rft_ieee_id=9423556&rfr_iscdi=true