Differential gene expression profile evaluation between uterine leiomyoma and leiomyosarcoma using a machine learning approach

The objective of this study is to differentiate between uterine leiomyomas (ULM) and uterine leiomyosarcomas (ULMS) by conducting molecular differential analysis and identifying potential prognostic biomarkers for diagnosis. The microarray datasets (GSEID: GSE64763 and GSE185543) were retrieved from...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Gynecology and obstetrics clinical medicine 2023-09, Vol.3 (3), p.154-162
Hauptverfasser: Upadhyay, Sonal, Bhushan, Ravi, Tripathi, Anima, Chaubey, Lavina, Diwakar, Amita, Dubey, Pawan K.
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The objective of this study is to differentiate between uterine leiomyomas (ULM) and uterine leiomyosarcomas (ULMS) by conducting molecular differential analysis and identifying potential prognostic biomarkers for diagnosis. The microarray datasets (GSEID: GSE64763 and GSE185543) were retrieved from the Gene Expression Omnibus database. Data preprocessing and differential gene expressions (DEGs) analysis were performed. The DEGs were further intersected to find the common DEGs in ULM and ULMS and further validation of selected DEGs were performed. Further, a machine learning classifier was also applied in the selection of biomarkers. Protein-protein interaction network based upon STRING v 10.5, was constructed. Additionally, Gene Ontology (GO) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analyses were also performed to dissect possible functions and pathways. A total of 50 significant DEGs for ULM while 321 DEGs for ULMS have been identified with their official gene symbol. Between ULM and ULMS, a total of 14 common DEGs were identified of which 8 were up-regulated while 6 were down-regulated. The DEGs of (GSE185543) were also analyzed and the significant genes were retrieved common in both datasets for further analysis. Using a machine learning approach, 10 feature genes were identified. Using the expression profiles of these genes, a sequential minimal optimization (SMO) prediction model was built on the training set, and it accurately and reliably classified features expression in ULM and ULMS in the independent test set. Furthermore, Co- Enrichment analysis was also performed. The study identified several DEGs, including ZNF365, EPYC, COL11A1, SHOX2, MMP13, TNN, GPM6A, and GATA2, through cross-validation, machine learning classifier, and Co- Enrichment analysis. These candidate disease genes may provide valuable insight into the underlying mechanisms and could be used as potential diagnostic biomarkers for ULM and ULMS. However, further validation of these genes is necessary to better understand their roles in the pathogenesis of ULM and ULMS. •Similar pattern of pathological appearance difficult to differentiate between ULM and ULMS.•Total 176 DEGs with 101 over-expressed and 75 under expressed genes were identified.•Focal adhesion, ECM-receptor interaction, long-term depressions are major pathways for these DEGs.•Total 14 common DEGs were identified of which 8 were up-regulated while 6 were down-regulated.•Our results highlig
ISSN:2667-1646
2097-0587
2667-1646
DOI:10.1016/j.gocm.2023.08.003