Hybrid pooling with wavelets for convolutional neural networks

The need to detect and classify objects correctly is a constant challenge, being able to recognize them at different scales and scenarios, sometimes cropped or badly lit is not an easy task. Convolutional neural networks (CNN) have become a widely applied technique since they are completely trainabl...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of intelligent & fuzzy systems 2022-01, Vol.42 (5), p.4327-4336
Hauptverfasser: Trevino-Sanchez, Daniel, Alarcon-Aquino, Vicente
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4336
container_issue 5
container_start_page 4327
container_title Journal of intelligent & fuzzy systems
container_volume 42
creator Trevino-Sanchez, Daniel
Alarcon-Aquino, Vicente
description The need to detect and classify objects correctly is a constant challenge, being able to recognize them at different scales and scenarios, sometimes cropped or badly lit is not an easy task. Convolutional neural networks (CNN) have become a widely applied technique since they are completely trainable and suitable to extract features. However, the growing number of convolutional neural networks applications constantly pushes their accuracy improvement. Initially, those improvements involved the use of large datasets, augmentation techniques, and complex algorithms. These methods may have a high computational cost. Nevertheless, feature extraction is known to be the heart of the problem. As a result, other approaches combine different technologies to extract better features to improve the accuracy without the need of more powerful hardware resources. In this paper, we propose a hybrid pooling method that incorporates multiresolution analysis within the CNN layers to reduce the feature map size without losing details. To prevent relevant information from losing during the downsampling process an existing pooling method is combined with wavelet transform technique, keeping those details "alive" and enriching other stages of the CNN. Achieving better quality characteristics improves CNN accuracy. To validate this study, ten pooling methods, including the proposed model, are tested using four benchmark datasets. The results are compared with four of the evaluated methods, which are also considered as the state-of-the-art.
doi_str_mv 10.3233/JIFS-219223
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2645877319</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2645877319</sourcerecordid><originalsourceid>FETCH-LOGICAL-c261t-c948a83e68ea7abee4793fed0d28c3da66cf049c4bb5f863bd05f1f6492020a13</originalsourceid><addsrcrecordid>eNotkMtKxDAARYMoOI6u_IGCS6nm1Tw2ggzOQwZcqOuQpol2rE1N0inz93asq3MXl8vlAHCN4B3BhNw_b5avOUYSY3ICZkjwIheS8dMxQ0ZzhCk7Bxcx7iBEvMBwBh7WhzLUVdZ539TtRzbU6TMb9N42NsXM-ZAZ3-5906fat7rJWtuHP6TBh694Cc6cbqK9-uccvC-f3hbrfPuy2iwet7nBDKXcSCq0IJYJq7kuraVcEmcrWGFhSKUZMw5SaWhZFk4wUlawcMgxKjHEUCMyBzfTbhf8T29jUjvfh_FQVJjRQnBOkBxbt1PLBB9jsE51of7W4aAQVEdB6ihITYLILwgAWPU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2645877319</pqid></control><display><type>article</type><title>Hybrid pooling with wavelets for convolutional neural networks</title><source>Business Source Complete (EB_SDU_P3)</source><creator>Trevino-Sanchez, Daniel ; Alarcon-Aquino, Vicente</creator><creatorcontrib>Trevino-Sanchez, Daniel ; Alarcon-Aquino, Vicente</creatorcontrib><description>The need to detect and classify objects correctly is a constant challenge, being able to recognize them at different scales and scenarios, sometimes cropped or badly lit is not an easy task. Convolutional neural networks (CNN) have become a widely applied technique since they are completely trainable and suitable to extract features. However, the growing number of convolutional neural networks applications constantly pushes their accuracy improvement. Initially, those improvements involved the use of large datasets, augmentation techniques, and complex algorithms. These methods may have a high computational cost. Nevertheless, feature extraction is known to be the heart of the problem. As a result, other approaches combine different technologies to extract better features to improve the accuracy without the need of more powerful hardware resources. In this paper, we propose a hybrid pooling method that incorporates multiresolution analysis within the CNN layers to reduce the feature map size without losing details. To prevent relevant information from losing during the downsampling process an existing pooling method is combined with wavelet transform technique, keeping those details "alive" and enriching other stages of the CNN. Achieving better quality characteristics improves CNN accuracy. To validate this study, ten pooling methods, including the proposed model, are tested using four benchmark datasets. The results are compared with four of the evaluated methods, which are also considered as the state-of-the-art.</description><identifier>ISSN: 1064-1246</identifier><identifier>EISSN: 1875-8967</identifier><identifier>DOI: 10.3233/JIFS-219223</identifier><language>eng</language><publisher>Amsterdam: IOS Press BV</publisher><subject>Accuracy ; Algorithms ; Artificial neural networks ; Datasets ; Feature extraction ; Feature maps ; Multiresolution analysis ; Neural networks ; Object recognition ; Wavelet transforms</subject><ispartof>Journal of intelligent &amp; fuzzy systems, 2022-01, Vol.42 (5), p.4327-4336</ispartof><rights>Copyright IOS Press BV 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c261t-c948a83e68ea7abee4793fed0d28c3da66cf049c4bb5f863bd05f1f6492020a13</citedby><cites>FETCH-LOGICAL-c261t-c948a83e68ea7abee4793fed0d28c3da66cf049c4bb5f863bd05f1f6492020a13</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Trevino-Sanchez, Daniel</creatorcontrib><creatorcontrib>Alarcon-Aquino, Vicente</creatorcontrib><title>Hybrid pooling with wavelets for convolutional neural networks</title><title>Journal of intelligent &amp; fuzzy systems</title><description>The need to detect and classify objects correctly is a constant challenge, being able to recognize them at different scales and scenarios, sometimes cropped or badly lit is not an easy task. Convolutional neural networks (CNN) have become a widely applied technique since they are completely trainable and suitable to extract features. However, the growing number of convolutional neural networks applications constantly pushes their accuracy improvement. Initially, those improvements involved the use of large datasets, augmentation techniques, and complex algorithms. These methods may have a high computational cost. Nevertheless, feature extraction is known to be the heart of the problem. As a result, other approaches combine different technologies to extract better features to improve the accuracy without the need of more powerful hardware resources. In this paper, we propose a hybrid pooling method that incorporates multiresolution analysis within the CNN layers to reduce the feature map size without losing details. To prevent relevant information from losing during the downsampling process an existing pooling method is combined with wavelet transform technique, keeping those details "alive" and enriching other stages of the CNN. Achieving better quality characteristics improves CNN accuracy. To validate this study, ten pooling methods, including the proposed model, are tested using four benchmark datasets. The results are compared with four of the evaluated methods, which are also considered as the state-of-the-art.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Datasets</subject><subject>Feature extraction</subject><subject>Feature maps</subject><subject>Multiresolution analysis</subject><subject>Neural networks</subject><subject>Object recognition</subject><subject>Wavelet transforms</subject><issn>1064-1246</issn><issn>1875-8967</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNotkMtKxDAARYMoOI6u_IGCS6nm1Tw2ggzOQwZcqOuQpol2rE1N0inz93asq3MXl8vlAHCN4B3BhNw_b5avOUYSY3ICZkjwIheS8dMxQ0ZzhCk7Bxcx7iBEvMBwBh7WhzLUVdZ539TtRzbU6TMb9N42NsXM-ZAZ3-5906fat7rJWtuHP6TBh694Cc6cbqK9-uccvC-f3hbrfPuy2iwet7nBDKXcSCq0IJYJq7kuraVcEmcrWGFhSKUZMw5SaWhZFk4wUlawcMgxKjHEUCMyBzfTbhf8T29jUjvfh_FQVJjRQnBOkBxbt1PLBB9jsE51of7W4aAQVEdB6ihITYLILwgAWPU</recordid><startdate>20220101</startdate><enddate>20220101</enddate><creator>Trevino-Sanchez, Daniel</creator><creator>Alarcon-Aquino, Vicente</creator><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20220101</creationdate><title>Hybrid pooling with wavelets for convolutional neural networks</title><author>Trevino-Sanchez, Daniel ; Alarcon-Aquino, Vicente</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c261t-c948a83e68ea7abee4793fed0d28c3da66cf049c4bb5f863bd05f1f6492020a13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Datasets</topic><topic>Feature extraction</topic><topic>Feature maps</topic><topic>Multiresolution analysis</topic><topic>Neural networks</topic><topic>Object recognition</topic><topic>Wavelet transforms</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Trevino-Sanchez, Daniel</creatorcontrib><creatorcontrib>Alarcon-Aquino, Vicente</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Trevino-Sanchez, Daniel</au><au>Alarcon-Aquino, Vicente</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hybrid pooling with wavelets for convolutional neural networks</atitle><jtitle>Journal of intelligent &amp; fuzzy systems</jtitle><date>2022-01-01</date><risdate>2022</risdate><volume>42</volume><issue>5</issue><spage>4327</spage><epage>4336</epage><pages>4327-4336</pages><issn>1064-1246</issn><eissn>1875-8967</eissn><abstract>The need to detect and classify objects correctly is a constant challenge, being able to recognize them at different scales and scenarios, sometimes cropped or badly lit is not an easy task. Convolutional neural networks (CNN) have become a widely applied technique since they are completely trainable and suitable to extract features. However, the growing number of convolutional neural networks applications constantly pushes their accuracy improvement. Initially, those improvements involved the use of large datasets, augmentation techniques, and complex algorithms. These methods may have a high computational cost. Nevertheless, feature extraction is known to be the heart of the problem. As a result, other approaches combine different technologies to extract better features to improve the accuracy without the need of more powerful hardware resources. In this paper, we propose a hybrid pooling method that incorporates multiresolution analysis within the CNN layers to reduce the feature map size without losing details. To prevent relevant information from losing during the downsampling process an existing pooling method is combined with wavelet transform technique, keeping those details "alive" and enriching other stages of the CNN. Achieving better quality characteristics improves CNN accuracy. To validate this study, ten pooling methods, including the proposed model, are tested using four benchmark datasets. The results are compared with four of the evaluated methods, which are also considered as the state-of-the-art.</abstract><cop>Amsterdam</cop><pub>IOS Press BV</pub><doi>10.3233/JIFS-219223</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1064-1246
ispartof Journal of intelligent & fuzzy systems, 2022-01, Vol.42 (5), p.4327-4336
issn 1064-1246
1875-8967
language eng
recordid cdi_proquest_journals_2645877319
source Business Source Complete (EB_SDU_P3)
subjects Accuracy
Algorithms
Artificial neural networks
Datasets
Feature extraction
Feature maps
Multiresolution analysis
Neural networks
Object recognition
Wavelet transforms
title Hybrid pooling with wavelets for convolutional neural networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T10%3A02%3A21IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hybrid%20pooling%20with%20wavelets%20for%20convolutional%20neural%20networks&rft.jtitle=Journal%20of%20intelligent%20&%20fuzzy%20systems&rft.au=Trevino-Sanchez,%20Daniel&rft.date=2022-01-01&rft.volume=42&rft.issue=5&rft.spage=4327&rft.epage=4336&rft.pages=4327-4336&rft.issn=1064-1246&rft.eissn=1875-8967&rft_id=info:doi/10.3233/JIFS-219223&rft_dat=%3Cproquest_cross%3E2645877319%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2645877319&rft_id=info:pmid/&rfr_iscdi=true