Accurate prediction of colorectal cancer diagnosis using machine learning based on immunohistochemistry pathological images

Colorectal cancer (CRC) ranks as the third most prevalent tumor and the second leading cause of mortality. Early and accurate diagnosis holds significant importance in enhancing patient treatment and prognosis. Machine learning technology and bioinformatics have provided novel approaches for cancer...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2024-12, Vol.14 (1), p.29882-10
Hauptverfasser: Ning, Bobin, Chi, Jimei, Meng, Qingyu, Jia, Baoqing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Colorectal cancer (CRC) ranks as the third most prevalent tumor and the second leading cause of mortality. Early and accurate diagnosis holds significant importance in enhancing patient treatment and prognosis. Machine learning technology and bioinformatics have provided novel approaches for cancer diagnosis. This study aims to develop a CRC diagnostic model based on immunohistochemical staining image features using machine learning methods. Initially, CRC disease-specific genes were identified through bioinformatics analysis, SVM-RFE and Random Forest algorithm utilizing RNA-seq data from both GEO and TCGA databases. Subsequently, verification of these genes was performed using proteomics data from CPTAC and HPA database, resulting in identification of target proteins (AKR1B10, CA2, DHRS9, and ZG16) for further investigation. SVM and CNN were then employed to analyze and integrate the characteristics of immunohistochemical images to construct a reliable CRC diagnostic model. During the training and validation process of this model, cross-validation along with external validation methods were implemented to ensure accuracy and reliability. The results demonstrate that the established diagnostic model exhibits excellent performance in distinguishing between CRC and normal controls (accuracy rate: 0.999), thereby presenting potential prospects for clinical application. These findings are expected to provide innovative perspectives as well as methodologies for personalized diagnosis of CRC while offering more precise references for promising treatment.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-76083-9