Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks

Anti-aliased Convolutional Neural Networks (CNNs) have been proposed to overcome the shift variant nature of the CNNs. The fundamental building block of the anti-aliased CNN has been the application of Gaussian or wavelet-based smoothing before the pooling operation. However, in all these approaches...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia systems 2023-06, Vol.29 (3), p.1277-1290
Hauptverfasser:	Sineesh, Adithya, Raveendranatha Panicker, Mahesh
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Classification Coders Computer Communication Networks Computer Graphics Computer Science Cryptology Data Storage Representation Feature maps Multimedia Information Systems Neural networks Operating Systems Regular Paper Segmentation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1290
container_issue	3
container_start_page	1277
container_title	Multimedia systems
container_volume	29
creator	Sineesh, Adithya Raveendranatha Panicker, Mahesh
description	Anti-aliased Convolutional Neural Networks (CNNs) have been proposed to overcome the shift variant nature of the CNNs. The fundamental building block of the anti-aliased CNN has been the application of Gaussian or wavelet-based smoothing before the pooling operation. However, in all these approaches, the feature maps’ edges are also smoothed while suppressing high-frequency components. In this work, two novel pooling approaches are presented, namely the Laplacian-Gaussian Concatenation with Attention (LGCA) pooling and Wavelet-based Approximate-Detailed Coefficient concatenation with Attention (WADCA) that can preserve the edges in the feature maps. The results suggest that the proposed pooling approaches outperform conventional as well as blur pooling for classification, segmentation and auto-encoders. In terms of average binary classification accuracy (cats vs dogs), the proposed LGCA approach outperforms the conventional pooling and blur pooling by 4% and 2%, 3% and 4%, 3% and 0.5% for MobileNetv2, DenseNet121 and ResNet50 respectively. On the other hand, the proposed WADCA approach outperforms the normal pooling and blur pooling by 5% and 3%, 2% and 3%, 2% and 0.17% for MobileNetv2, DenseNet121 and ResNet50 respectively. It is also observed from the results that edge-preserving pooling does not have any significance in segmentation tasks possibly due to high to low-resolution translation. Meanwhile, high-resolution reconstruction has been observed for the LGCA pooling in the case of convolutional auto-encoders.
doi_str_mv	10.1007/s00530-023-01053-6
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2821007590</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2821007590</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-bd62bfabc8ca49f68fd08a77f466b29672497a93e14f2f3dbc988b681ff3769f3</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMouK7-AU8Bz9FJ0k0Tb7KsH7DgRc8hbZPStSY1aSv-e7tbxZunGZjnfRkehC4pXFOA_CYBrDgQYJwAnVYijtCCZpwRKiU7RgtQGSOZEuwUnaW0A6C54LBAelPVFnfRJhtHW-HBN6ONybS4C6FtfH2LfRhti1MfTW_rxibsQvy94sbjMvgxtEPfBD_FvB3iYfSfIb6lc3TiTJvsxc9cotf7zcv6kWyfH57Wd1tScqp6UlSCFc4UpSxNppyQrgJp8txlQhRMiZxlKjeKW5o55nhVlErKQkjqHM-FcnyJrubeLoaPwaZe78IQp4eSZpLtHa0UTBSbqTKGlKJ1uovNu4lfmoLeQ3oWqSeR-iBSiynE51CaYF_b-Ff9T-obzIR4Cg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2821007590</pqid></control><display><type>article</type><title>Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks</title><source>SpringerLink Journals - AutoHoldings</source><creator>Sineesh, Adithya ; Raveendranatha Panicker, Mahesh</creator><creatorcontrib>Sineesh, Adithya ; Raveendranatha Panicker, Mahesh</creatorcontrib><description>Anti-aliased Convolutional Neural Networks (CNNs) have been proposed to overcome the shift variant nature of the CNNs. The fundamental building block of the anti-aliased CNN has been the application of Gaussian or wavelet-based smoothing before the pooling operation. However, in all these approaches, the feature maps’ edges are also smoothed while suppressing high-frequency components. In this work, two novel pooling approaches are presented, namely the Laplacian-Gaussian Concatenation with Attention (LGCA) pooling and Wavelet-based Approximate-Detailed Coefficient concatenation with Attention (WADCA) that can preserve the edges in the feature maps. The results suggest that the proposed pooling approaches outperform conventional as well as blur pooling for classification, segmentation and auto-encoders. In terms of average binary classification accuracy (cats vs dogs), the proposed LGCA approach outperforms the conventional pooling and blur pooling by 4% and 2%, 3% and 4%, 3% and 0.5% for MobileNetv2, DenseNet121 and ResNet50 respectively. On the other hand, the proposed WADCA approach outperforms the normal pooling and blur pooling by 5% and 3%, 2% and 3%, 2% and 0.17% for MobileNetv2, DenseNet121 and ResNet50 respectively. It is also observed from the results that edge-preserving pooling does not have any significance in segmentation tasks possibly due to high to low-resolution translation. Meanwhile, high-resolution reconstruction has been observed for the LGCA pooling in the case of convolutional auto-encoders.</description><identifier>ISSN: 0942-4962</identifier><identifier>EISSN: 1432-1882</identifier><identifier>DOI: 10.1007/s00530-023-01053-6</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Artificial neural networks ; Classification ; Coders ; Computer Communication Networks ; Computer Graphics ; Computer Science ; Cryptology ; Data Storage Representation ; Feature maps ; Multimedia Information Systems ; Neural networks ; Operating Systems ; Regular Paper ; Segmentation</subject><ispartof>Multimedia systems, 2023-06, Vol.29 (3), p.1277-1290</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-bd62bfabc8ca49f68fd08a77f466b29672497a93e14f2f3dbc988b681ff3769f3</citedby><cites>FETCH-LOGICAL-c319t-bd62bfabc8ca49f68fd08a77f466b29672497a93e14f2f3dbc988b681ff3769f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00530-023-01053-6$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00530-023-01053-6$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Sineesh, Adithya</creatorcontrib><creatorcontrib>Raveendranatha Panicker, Mahesh</creatorcontrib><title>Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks</title><title>Multimedia systems</title><addtitle>Multimedia Systems</addtitle><description>Anti-aliased Convolutional Neural Networks (CNNs) have been proposed to overcome the shift variant nature of the CNNs. The fundamental building block of the anti-aliased CNN has been the application of Gaussian or wavelet-based smoothing before the pooling operation. However, in all these approaches, the feature maps’ edges are also smoothed while suppressing high-frequency components. In this work, two novel pooling approaches are presented, namely the Laplacian-Gaussian Concatenation with Attention (LGCA) pooling and Wavelet-based Approximate-Detailed Coefficient concatenation with Attention (WADCA) that can preserve the edges in the feature maps. The results suggest that the proposed pooling approaches outperform conventional as well as blur pooling for classification, segmentation and auto-encoders. In terms of average binary classification accuracy (cats vs dogs), the proposed LGCA approach outperforms the conventional pooling and blur pooling by 4% and 2%, 3% and 4%, 3% and 0.5% for MobileNetv2, DenseNet121 and ResNet50 respectively. On the other hand, the proposed WADCA approach outperforms the normal pooling and blur pooling by 5% and 3%, 2% and 3%, 2% and 0.17% for MobileNetv2, DenseNet121 and ResNet50 respectively. It is also observed from the results that edge-preserving pooling does not have any significance in segmentation tasks possibly due to high to low-resolution translation. Meanwhile, high-resolution reconstruction has been observed for the LGCA pooling in the case of convolutional auto-encoders.</description><subject>Artificial neural networks</subject><subject>Classification</subject><subject>Coders</subject><subject>Computer Communication Networks</subject><subject>Computer Graphics</subject><subject>Computer Science</subject><subject>Cryptology</subject><subject>Data Storage Representation</subject><subject>Feature maps</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Operating Systems</subject><subject>Regular Paper</subject><subject>Segmentation</subject><issn>0942-4962</issn><issn>1432-1882</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LxDAQhoMouK7-AU8Bz9FJ0k0Tb7KsH7DgRc8hbZPStSY1aSv-e7tbxZunGZjnfRkehC4pXFOA_CYBrDgQYJwAnVYijtCCZpwRKiU7RgtQGSOZEuwUnaW0A6C54LBAelPVFnfRJhtHW-HBN6ONybS4C6FtfH2LfRhti1MfTW_rxibsQvy94sbjMvgxtEPfBD_FvB3iYfSfIb6lc3TiTJvsxc9cotf7zcv6kWyfH57Wd1tScqp6UlSCFc4UpSxNppyQrgJp8txlQhRMiZxlKjeKW5o55nhVlErKQkjqHM-FcnyJrubeLoaPwaZe78IQp4eSZpLtHa0UTBSbqTKGlKJ1uovNu4lfmoLeQ3oWqSeR-iBSiynE51CaYF_b-Ff9T-obzIR4Cg</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Sineesh, Adithya</creator><creator>Raveendranatha Panicker, Mahesh</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20230601</creationdate><title>Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks</title><author>Sineesh, Adithya ; Raveendranatha Panicker, Mahesh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-bd62bfabc8ca49f68fd08a77f466b29672497a93e14f2f3dbc988b681ff3769f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Classification</topic><topic>Coders</topic><topic>Computer Communication Networks</topic><topic>Computer Graphics</topic><topic>Computer Science</topic><topic>Cryptology</topic><topic>Data Storage Representation</topic><topic>Feature maps</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Operating Systems</topic><topic>Regular Paper</topic><topic>Segmentation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sineesh, Adithya</creatorcontrib><creatorcontrib>Raveendranatha Panicker, Mahesh</creatorcontrib><collection>CrossRef</collection><jtitle>Multimedia systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sineesh, Adithya</au><au>Raveendranatha Panicker, Mahesh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks</atitle><jtitle>Multimedia systems</jtitle><stitle>Multimedia Systems</stitle><date>2023-06-01</date><risdate>2023</risdate><volume>29</volume><issue>3</issue><spage>1277</spage><epage>1290</epage><pages>1277-1290</pages><issn>0942-4962</issn><eissn>1432-1882</eissn><abstract>Anti-aliased Convolutional Neural Networks (CNNs) have been proposed to overcome the shift variant nature of the CNNs. The fundamental building block of the anti-aliased CNN has been the application of Gaussian or wavelet-based smoothing before the pooling operation. However, in all these approaches, the feature maps’ edges are also smoothed while suppressing high-frequency components. In this work, two novel pooling approaches are presented, namely the Laplacian-Gaussian Concatenation with Attention (LGCA) pooling and Wavelet-based Approximate-Detailed Coefficient concatenation with Attention (WADCA) that can preserve the edges in the feature maps. The results suggest that the proposed pooling approaches outperform conventional as well as blur pooling for classification, segmentation and auto-encoders. In terms of average binary classification accuracy (cats vs dogs), the proposed LGCA approach outperforms the conventional pooling and blur pooling by 4% and 2%, 3% and 4%, 3% and 0.5% for MobileNetv2, DenseNet121 and ResNet50 respectively. On the other hand, the proposed WADCA approach outperforms the normal pooling and blur pooling by 5% and 3%, 2% and 3%, 2% and 0.17% for MobileNetv2, DenseNet121 and ResNet50 respectively. It is also observed from the results that edge-preserving pooling does not have any significance in segmentation tasks possibly due to high to low-resolution translation. Meanwhile, high-resolution reconstruction has been observed for the LGCA pooling in the case of convolutional auto-encoders.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00530-023-01053-6</doi><tpages>14</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0942-4962
ispartof	Multimedia systems, 2023-06, Vol.29 (3), p.1277-1290
issn	0942-4962 1432-1882
language	eng
recordid	cdi_proquest_journals_2821007590
source	SpringerLink Journals - AutoHoldings
subjects	Artificial neural networks Classification Coders Computer Communication Networks Computer Graphics Computer Science Cryptology Data Storage Representation Feature maps Multimedia Information Systems Neural networks Operating Systems Regular Paper Segmentation
title	Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T19%3A29%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Edge%20preserved%20universal%20pooling:%20novel%20strategies%20for%20pooling%20in%20convolutional%20neural%20networks&rft.jtitle=Multimedia%20systems&rft.au=Sineesh,%20Adithya&rft.date=2023-06-01&rft.volume=29&rft.issue=3&rft.spage=1277&rft.epage=1290&rft.pages=1277-1290&rft.issn=0942-4962&rft.eissn=1432-1882&rft_id=info:doi/10.1007/s00530-023-01053-6&rft_dat=%3Cproquest_cross%3E2821007590%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2821007590&rft_id=info:pmid/&rfr_iscdi=true