Interpretable machine learning identifies paediatric Systemic Lupus Erythematosus subtypes based on gene expression data

Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learni...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2022-05, Vol.12 (1), p.7433-10, Article 7433
Hauptverfasser: Yones, Sara A., Annett, Alva, Stoll, Patricia, Diamanti, Klev, Holmfeldt, Linda, Barrenäs, Carl Fredrik, Meadows, Jennifer R. S., Komorowski, Jan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Transcriptomic analyses are commonly used to identify differentially expressed genes between patients and controls, or within individuals across disease courses. These methods, whilst effective, cannot encompass the combinatorial effects of genes driving disease. We applied rule-based machine learning (RBML) models and rule networks (RN) to an existing paediatric Systemic Lupus Erythematosus (SLE) blood expression dataset, with the goal of developing gene networks to separate low and high disease activity (DA1 and DA3). The resultant model had an 81% accuracy to distinguish between DA1 and DA3, with unsupervised hierarchical clustering revealing additional subgroups indicative of the immune axis involved or state of disease flare. These subgroups correlated with clinical variables, suggesting that the gene sets identified may further the understanding of gene networks that act in concert to drive disease progression. This included roles for genes (i) induced by interferons ( IFI35 and OTOF ), (ii) key to SLE cell types ( KLRB1 encoding CD161), or (iii) with roles in autophagy and NF-κB pathway responses ( CKAP4 ). As demonstrated here, RBML approaches have the potential to reveal novel gene patterns from within a heterogeneous disease, facilitating patient clinical and therapeutic stratification.
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-022-10853-1