Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk

(1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene( ), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering hi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of environmental research and public health 2021-03, Vol.18 (5), p.2518
Hauptverfasser: Briggs, Farren B S, Sept, Corriene
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:(1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene( ), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering higher order genetic relationships, but their use in MS have been limited. (2) Methods: Association rule mining (ARM), a combinatorial rule-based machine learning algorithm, was applied to genetic data for non-Latinx MS cases ( = 207) and controls ( = 179). The objective was to identify patterns (rules) amongst the known MS risk variants, including presence, absence, and 194 of the 200 common autosomal variants. Probabilistic measures (confidence and support) were used to mine rules. (3) Results: 114 rules met minimum requirements of 80% confidence and 5% support. The top ranking rule by confidence consisted of , -rs56678847 and -rs6880809; carriers of these variants had a significantly greater risk for MS (odds ratio = 20.2, 95% CI: 8.5, 37.5; = 4 × 10 ). Several variants were shared across rules, the most common was -rs78727559, which was in 32.5% of rules. (4) Conclusions: In summary, we demonstrate evidence that specific combinations of MS risk variants disproportionately confer elevated risk by applying a robust analytical framework to a modestly sized study population.
ISSN:1660-4601
1661-7827
1660-4601
DOI:10.3390/ijerph18052518