Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration

This research presents an integrated framework designed to automate the classification of pulmonary chest x-ray images. Leveraging convolutional neural networks (CNNs) with a focus on transformer architectures, the aim is to improve both the accuracy and efficiency of pulmonary chest x-ray image ana...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Biomedical physics & engineering express 2024-11, Vol.11 (1), p.15009
Hauptverfasser: Mohanty, Manas Ranjan, Mallick, Pradeep Kumar, Reddy, Annapareddy V N
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 15009
container_title Biomedical physics & engineering express
container_volume 11
creator Mohanty, Manas Ranjan
Mallick, Pradeep Kumar
Reddy, Annapareddy V N
description This research presents an integrated framework designed to automate the classification of pulmonary chest x-ray images. Leveraging convolutional neural networks (CNNs) with a focus on transformer architectures, the aim is to improve both the accuracy and efficiency of pulmonary chest x-ray image analysis. A central aspect of this approach involves utilizing pre-trained networks such as VGG16, ResNet50, and MobileNetV2 to create a feature ensemble. A notable innovation is the adoption of a stacked ensemble technique, which combines outputs from multiple pre-trained models to generate a comprehensive feature representation. In the feature ensemble approach, each image undergoes individual processing through the three pre-trained networks, and pooled images are extracted just before the flatten layer of each model. Consequently, three pooled images in 2D grayscale format are obtained for each original image. These pooled images serve as samples for creating 3D images resembling RGB images through stacking, intended for classifier input in subsequent analysis stages. By incorporating stacked pooling layers to facilitate feature ensemble, a broader range of features is utilized while effectively managing complexities associated with processing the augmented feature pool. Moreover, the study incorporates the Swin Transformer architecture, known for effectively capturing both local and global features. The Swin Transformer architecture is further optimized using the artificial hummingbird algorithm (AHA). By fine-tuning hyperparameters such as patch size, multi-layer perceptron (MLP) ratio, and channel numbers, the AHA optimization technique aims to maximize classification accuracy. The proposed integrated framework, featuring the AHA-optimized Swin Transformer classifier utilizing stacked features, is evaluated using three diverse chest x-ray datasets-VinDr-CXR, PediCXR, and MIMIC-CXR. The observed accuracies of 98.874%, 98.528%, and 98.958% respectively, underscore the robustness and generalizability of the developed model across various clinical scenarios and imaging conditions.
doi_str_mv 10.1088/2057-1976/ad8c46
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_39504146</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3125484869</sourcerecordid><originalsourceid>FETCH-LOGICAL-c219t-19c39e84b4a50bdf5ea5657947cd5b8ce6ebd79230557cfa2cddb8b7eb94cf373</originalsourceid><addsrcrecordid>eNp1kEtPwzAQhC0Eoqj0zgn5yIFSJ7Gd-IgqXlKlXuBs-bEBQ-IE2xGPX09KoeLCaVermdHOh9BJRi4yUlWLnLBynomSL5StDOV76Gh32v-zT9AsxmdCSMZzzgU7RJNCMEIzyo-QX_fJte7T-UfcD03beRU-sHmCmPD7PKhxb1SMrnZGJdd5_ObSE45JmRewuAaVhgAYfIRWN4CVtzi-OY9TUD7WXWghYOcTPIZv-zE6qFUTYfYzp-jh-up-eTtfrW_ulperuckzkca3TSGgopoqRrStGSjGWSloaSzTlQEO2pYiLwhjpalVbqzVlS5BC2rqoiym6Gyb24fudRjLyNZFA02jPHRDlEWWM1rRiotRSrZSE7oYA9SyD64dKciMyA1ouSEpNyTlFvRoOf1JH3QLdmf4xToKzrcC1_XyuRuCH8v-n_cFl5GKaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3125484869</pqid></control><display><type>article</type><title>Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration</title><source>MEDLINE</source><source>IOP Publishing Journals</source><source>Institute of Physics (IOP) Journals - HEAL-Link</source><creator>Mohanty, Manas Ranjan ; Mallick, Pradeep Kumar ; Reddy, Annapareddy V N</creator><creatorcontrib>Mohanty, Manas Ranjan ; Mallick, Pradeep Kumar ; Reddy, Annapareddy V N</creatorcontrib><description>This research presents an integrated framework designed to automate the classification of pulmonary chest x-ray images. Leveraging convolutional neural networks (CNNs) with a focus on transformer architectures, the aim is to improve both the accuracy and efficiency of pulmonary chest x-ray image analysis. A central aspect of this approach involves utilizing pre-trained networks such as VGG16, ResNet50, and MobileNetV2 to create a feature ensemble. A notable innovation is the adoption of a stacked ensemble technique, which combines outputs from multiple pre-trained models to generate a comprehensive feature representation. In the feature ensemble approach, each image undergoes individual processing through the three pre-trained networks, and pooled images are extracted just before the flatten layer of each model. Consequently, three pooled images in 2D grayscale format are obtained for each original image. These pooled images serve as samples for creating 3D images resembling RGB images through stacking, intended for classifier input in subsequent analysis stages. By incorporating stacked pooling layers to facilitate feature ensemble, a broader range of features is utilized while effectively managing complexities associated with processing the augmented feature pool. Moreover, the study incorporates the Swin Transformer architecture, known for effectively capturing both local and global features. The Swin Transformer architecture is further optimized using the artificial hummingbird algorithm (AHA). By fine-tuning hyperparameters such as patch size, multi-layer perceptron (MLP) ratio, and channel numbers, the AHA optimization technique aims to maximize classification accuracy. The proposed integrated framework, featuring the AHA-optimized Swin Transformer classifier utilizing stacked features, is evaluated using three diverse chest x-ray datasets-VinDr-CXR, PediCXR, and MIMIC-CXR. The observed accuracies of 98.874%, 98.528%, and 98.958% respectively, underscore the robustness and generalizability of the developed model across various clinical scenarios and imaging conditions.</description><identifier>ISSN: 2057-1976</identifier><identifier>EISSN: 2057-1976</identifier><identifier>DOI: 10.1088/2057-1976/ad8c46</identifier><identifier>PMID: 39504146</identifier><language>eng</language><publisher>England: IOP Publishing</publisher><subject>Algorithms ; Databases, Factual ; feature ensemble ; Humans ; Image Processing, Computer-Assisted - methods ; Imaging, Three-Dimensional - methods ; Lung - diagnostic imaging ; MobileNetV2 ; Neural Networks, Computer ; pulmonary chest x-ray classification ; Radiography, Thoracic - methods ; ResNet50 ; self-attention transformer ; stacked feature ensemble ; VGG16</subject><ispartof>Biomedical physics &amp; engineering express, 2024-11, Vol.11 (1), p.15009</ispartof><rights>2024 IOP Publishing Ltd. All rights, including for text and data mining, AI training, and similar technologies, are reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c219t-19c39e84b4a50bdf5ea5657947cd5b8ce6ebd79230557cfa2cddb8b7eb94cf373</cites><orcidid>0000-0002-1207-0757</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://iopscience.iop.org/article/10.1088/2057-1976/ad8c46/pdf$$EPDF$$P50$$Giop$$H</linktopdf><link.rule.ids>314,776,780,27901,27902,53821,53868</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39504146$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mohanty, Manas Ranjan</creatorcontrib><creatorcontrib>Mallick, Pradeep Kumar</creatorcontrib><creatorcontrib>Reddy, Annapareddy V N</creatorcontrib><title>Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration</title><title>Biomedical physics &amp; engineering express</title><addtitle>BPEX</addtitle><addtitle>Biomed. Phys. Eng. Express</addtitle><description>This research presents an integrated framework designed to automate the classification of pulmonary chest x-ray images. Leveraging convolutional neural networks (CNNs) with a focus on transformer architectures, the aim is to improve both the accuracy and efficiency of pulmonary chest x-ray image analysis. A central aspect of this approach involves utilizing pre-trained networks such as VGG16, ResNet50, and MobileNetV2 to create a feature ensemble. A notable innovation is the adoption of a stacked ensemble technique, which combines outputs from multiple pre-trained models to generate a comprehensive feature representation. In the feature ensemble approach, each image undergoes individual processing through the three pre-trained networks, and pooled images are extracted just before the flatten layer of each model. Consequently, three pooled images in 2D grayscale format are obtained for each original image. These pooled images serve as samples for creating 3D images resembling RGB images through stacking, intended for classifier input in subsequent analysis stages. By incorporating stacked pooling layers to facilitate feature ensemble, a broader range of features is utilized while effectively managing complexities associated with processing the augmented feature pool. Moreover, the study incorporates the Swin Transformer architecture, known for effectively capturing both local and global features. The Swin Transformer architecture is further optimized using the artificial hummingbird algorithm (AHA). By fine-tuning hyperparameters such as patch size, multi-layer perceptron (MLP) ratio, and channel numbers, the AHA optimization technique aims to maximize classification accuracy. The proposed integrated framework, featuring the AHA-optimized Swin Transformer classifier utilizing stacked features, is evaluated using three diverse chest x-ray datasets-VinDr-CXR, PediCXR, and MIMIC-CXR. The observed accuracies of 98.874%, 98.528%, and 98.958% respectively, underscore the robustness and generalizability of the developed model across various clinical scenarios and imaging conditions.</description><subject>Algorithms</subject><subject>Databases, Factual</subject><subject>feature ensemble</subject><subject>Humans</subject><subject>Image Processing, Computer-Assisted - methods</subject><subject>Imaging, Three-Dimensional - methods</subject><subject>Lung - diagnostic imaging</subject><subject>MobileNetV2</subject><subject>Neural Networks, Computer</subject><subject>pulmonary chest x-ray classification</subject><subject>Radiography, Thoracic - methods</subject><subject>ResNet50</subject><subject>self-attention transformer</subject><subject>stacked feature ensemble</subject><subject>VGG16</subject><issn>2057-1976</issn><issn>2057-1976</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp1kEtPwzAQhC0Eoqj0zgn5yIFSJ7Gd-IgqXlKlXuBs-bEBQ-IE2xGPX09KoeLCaVermdHOh9BJRi4yUlWLnLBynomSL5StDOV76Gh32v-zT9AsxmdCSMZzzgU7RJNCMEIzyo-QX_fJte7T-UfcD03beRU-sHmCmPD7PKhxb1SMrnZGJdd5_ObSE45JmRewuAaVhgAYfIRWN4CVtzi-OY9TUD7WXWghYOcTPIZv-zE6qFUTYfYzp-jh-up-eTtfrW_ulperuckzkca3TSGgopoqRrStGSjGWSloaSzTlQEO2pYiLwhjpalVbqzVlS5BC2rqoiym6Gyb24fudRjLyNZFA02jPHRDlEWWM1rRiotRSrZSE7oYA9SyD64dKciMyA1ouSEpNyTlFvRoOf1JH3QLdmf4xToKzrcC1_XyuRuCH8v-n_cFl5GKaw</recordid><startdate>20241106</startdate><enddate>20241106</enddate><creator>Mohanty, Manas Ranjan</creator><creator>Mallick, Pradeep Kumar</creator><creator>Reddy, Annapareddy V N</creator><general>IOP Publishing</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-1207-0757</orcidid></search><sort><creationdate>20241106</creationdate><title>Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration</title><author>Mohanty, Manas Ranjan ; Mallick, Pradeep Kumar ; Reddy, Annapareddy V N</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c219t-19c39e84b4a50bdf5ea5657947cd5b8ce6ebd79230557cfa2cddb8b7eb94cf373</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Databases, Factual</topic><topic>feature ensemble</topic><topic>Humans</topic><topic>Image Processing, Computer-Assisted - methods</topic><topic>Imaging, Three-Dimensional - methods</topic><topic>Lung - diagnostic imaging</topic><topic>MobileNetV2</topic><topic>Neural Networks, Computer</topic><topic>pulmonary chest x-ray classification</topic><topic>Radiography, Thoracic - methods</topic><topic>ResNet50</topic><topic>self-attention transformer</topic><topic>stacked feature ensemble</topic><topic>VGG16</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mohanty, Manas Ranjan</creatorcontrib><creatorcontrib>Mallick, Pradeep Kumar</creatorcontrib><creatorcontrib>Reddy, Annapareddy V N</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Biomedical physics &amp; engineering express</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mohanty, Manas Ranjan</au><au>Mallick, Pradeep Kumar</au><au>Reddy, Annapareddy V N</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration</atitle><jtitle>Biomedical physics &amp; engineering express</jtitle><stitle>BPEX</stitle><addtitle>Biomed. Phys. Eng. Express</addtitle><date>2024-11-06</date><risdate>2024</risdate><volume>11</volume><issue>1</issue><spage>15009</spage><pages>15009-</pages><issn>2057-1976</issn><eissn>2057-1976</eissn><abstract>This research presents an integrated framework designed to automate the classification of pulmonary chest x-ray images. Leveraging convolutional neural networks (CNNs) with a focus on transformer architectures, the aim is to improve both the accuracy and efficiency of pulmonary chest x-ray image analysis. A central aspect of this approach involves utilizing pre-trained networks such as VGG16, ResNet50, and MobileNetV2 to create a feature ensemble. A notable innovation is the adoption of a stacked ensemble technique, which combines outputs from multiple pre-trained models to generate a comprehensive feature representation. In the feature ensemble approach, each image undergoes individual processing through the three pre-trained networks, and pooled images are extracted just before the flatten layer of each model. Consequently, three pooled images in 2D grayscale format are obtained for each original image. These pooled images serve as samples for creating 3D images resembling RGB images through stacking, intended for classifier input in subsequent analysis stages. By incorporating stacked pooling layers to facilitate feature ensemble, a broader range of features is utilized while effectively managing complexities associated with processing the augmented feature pool. Moreover, the study incorporates the Swin Transformer architecture, known for effectively capturing both local and global features. The Swin Transformer architecture is further optimized using the artificial hummingbird algorithm (AHA). By fine-tuning hyperparameters such as patch size, multi-layer perceptron (MLP) ratio, and channel numbers, the AHA optimization technique aims to maximize classification accuracy. The proposed integrated framework, featuring the AHA-optimized Swin Transformer classifier utilizing stacked features, is evaluated using three diverse chest x-ray datasets-VinDr-CXR, PediCXR, and MIMIC-CXR. The observed accuracies of 98.874%, 98.528%, and 98.958% respectively, underscore the robustness and generalizability of the developed model across various clinical scenarios and imaging conditions.</abstract><cop>England</cop><pub>IOP Publishing</pub><pmid>39504146</pmid><doi>10.1088/2057-1976/ad8c46</doi><tpages>22</tpages><orcidid>https://orcid.org/0000-0002-1207-0757</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 2057-1976
ispartof Biomedical physics & engineering express, 2024-11, Vol.11 (1), p.15009
issn 2057-1976
2057-1976
language eng
recordid cdi_pubmed_primary_39504146
source MEDLINE; IOP Publishing Journals; Institute of Physics (IOP) Journals - HEAL-Link
subjects Algorithms
Databases, Factual
feature ensemble
Humans
Image Processing, Computer-Assisted - methods
Imaging, Three-Dimensional - methods
Lung - diagnostic imaging
MobileNetV2
Neural Networks, Computer
pulmonary chest x-ray classification
Radiography, Thoracic - methods
ResNet50
self-attention transformer
stacked feature ensemble
VGG16
title Optimizing pulmonary chest x-ray classification with stacked feature ensemble and swin transformer integration
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T16%3A33%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimizing%20pulmonary%20chest%20x-ray%20classification%20with%20stacked%20feature%20ensemble%20and%20swin%20transformer%20integration&rft.jtitle=Biomedical%20physics%20&%20engineering%20express&rft.au=Mohanty,%20Manas%20Ranjan&rft.date=2024-11-06&rft.volume=11&rft.issue=1&rft.spage=15009&rft.pages=15009-&rft.issn=2057-1976&rft.eissn=2057-1976&rft_id=info:doi/10.1088/2057-1976/ad8c46&rft_dat=%3Cproquest_pubme%3E3125484869%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3125484869&rft_id=info:pmid/39504146&rfr_iscdi=true