Aggregation of experts: an application in the field of "interactomics" (detection of interactions on the basis of genomic data)

Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various meth...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC bioinformatics 2018-11, Vol.19 (1), p.445-445, Article 445
Hauptverfasser:	Abo Alchamlat, Sinan, Farnir, Frédéric
Format:	Artikel
Sprache:	eng
Schlagworte:	Agglomeration Arthritis Arthritis, Rheumatoid - genetics Bioinformatics Computational Biology - methods Computer simulation Control methods Data processing Determinism Epistasis Epistasis, Genetic Gene mapping Gene-gene interaction Genes Genetic diversity Genetics & genetic processes Genome-wide association study Genome-Wide Association Study - methods Genomes Genomics - methods Génétique & processus génétiques Heterogeneity Humans K-nearest neighbors Life sciences Mapping Methodology Methods Models, Genetic Multi dimensional reduction Performance enhancement Phenotype Polymorphism, Single Nucleotide Rheumatoid arthritis Sciences du vivant Single nucleotide polymorphism Statistical methods Strategy
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Despite the successful mapping of genes involved in the determinism of numerous traits, a large part of the genetic variation remains unexplained. A possible explanation is that the simple models used in many studies might not properly fit the actual underlying situations. Consequently, various methods have attempted to deal with the simultaneous mapping of genomic regions, assuming that these regions might interact, leading to a complex determinism for various traits. Despite some successes, no gold standard methodology has emerged. Actually, combining several interaction mapping methods might be a better strategy, leading to positive results over a larger set of situations. Our work is a step in that direction. We first have demonstrated why aggregating results from several distinct methods might increase the statistical power while controlling the type I error. We have illustrated the approach using 6 existing methods (namely: MDR, Boost, BHIT, KNN-MDR, MegaSNPHunter and AntEpiSeeker) on simulated and real data sets. We have used a very simple aggregation strategy: a majority vote across the best loci combinations identified by the individual methods. In order to assess the performances of our aggregation approach in problems where most individual methods tend to fail, we have simulated difficult situations where no marginal effects of individual genes exist and where genetic heterogeneity is present. we have also demonstrated the use of the strategy on real data, using a WTCCC dataset on rheumatoid arthritis. Since we have been using simplistic assumptions to infer the expected power of the aggregation method, the actual power we estimated from our simulations has turned out to be a bit smaller than theoretically expected. Results nevertheless have shown that grouping the results of several methods is advantageous in terms of power, accuracy and type I error control. Furthermore, as more methods should become available in the future, using a grouping strategy will become more advantageous since adding more methods seems to improve the performances of the aggregated method. The aggregation of methods as a tool to detect genetic interactions is a potentially useful addition to the arsenal used in complex traits analyses.
ISSN:	1471-2105 1471-2105
DOI:	10.1186/s12859-018-2447-0