A vision transformer machine learning model for COVID-19 diagnosis using chest X-ray images

This study leverages machine learning to enhance the diagnostic accuracy of COVID-19 using chest X-rays. The study evaluates various architectures, including efficient neural networks (EfficientNet), multiscale vision transformers (MViT), efficient vision transformers (EfficientViT), and vision tran...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Healthcare analytics (New York, N.Y.) N.Y.), 2024-06, Vol.5, p.100332, Article 100332
Hauptverfasser:	Chen, Tianyi, Philippi, Ian, Phan, Quoc Bao, Nguyen, Linh, Bui, Ngoc Thang, daCunha, Carlo, Nguyen, Tuy Tan
Format:	Artikel
Sprache:	eng
Schlagworte:	Chest X-ray Computer-aided diagnosis COVID-19 Efficient neural networks Machine learning Vision transformer
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This study leverages machine learning to enhance the diagnostic accuracy of COVID-19 using chest X-rays. The study evaluates various architectures, including efficient neural networks (EfficientNet), multiscale vision transformers (MViT), efficient vision transformers (EfficientViT), and vision transformers (ViT), against a comprehensive open-source dataset comprising 3616 COVID-19, 6012 lung opacity, 10192 normal, and 1345 viral pneumonia images. The analysis, focusing on loss functions and evaluation metrics, demonstrates distinct performance variations among these models. Notably, multiscale models like MViT and EfficientNet tend towards overfitting. Conversely, our vision transformer model, innovatively fine-tuned (FT) on the encoder blocks, exhibits superior accuracy: 95.79% in four-class, 99.57% in three-class, and similarly high performance in binary classifications, along with a recall of 98.58%, precision of 98.87%, F1 score of 98.73%, specificity of 99.76%, and area under the receiver operating characteristic (ROC) curve (AUC) of 0.9993. The study confirms the vision transformer model’s efficacy through rigorous validation using quantitative metrics and visualization techniques and illustrates its superiority over conventional models. The innovative fine-tuning method applied to vision transformers presents a significant advancement in medical image analysis, offering a promising avenue for improving the accuracy and reliability of COVID-19 diagnosis from chest X-ray images. •Present a comprehensive review of existing machine learning models for COVID-19 lung image classification.•Provide an in-depth analysis of experimented model structures, focusing on the Vision Transformer.•Perform an experimental evaluation of various model structures on the COVID-19 chest X-ray dataset to assess performance.•Introduce an elaborately fine-tuned vision transformer model based on early experimental insights.•Validate the performance of the proposed model using advanced evaluation metrics and visualization techniques.
ISSN:	2772-4425 2772-4425
DOI:	10.1016/j.health.2024.100332