CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases

Whole‐genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CA...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Human mutation 2019-09, Vol.40 (9), p.1373-1391
Hauptverfasser: Kasak, Laura, Hunter, Jesse M., Udani, Rupa, Bakolitsa, Constantina, Hu, Zhiqiang, Adhikari, Aashish N., Babbi, Giulia, Casadio, Rita, Gough, Julian, Guerrero, Rafael F., Jiang, Yuxiang, Joseph, Thomas, Katsonis, Panagiotis, Kotte, Sujatha, Kundu, Kunal, Lichtarge, Olivier, Martelli, Pier Luigi, Mooney, Sean D., Moult, John, Pal, Lipika R., Poitras, Jennifer, Radivojac, Predrag, Rao, Aditya, Sivadasan, Naveen, Sunderam, Uma, Saipradeep, V. G., Yin, Yizhou, Zaucha, Jan, Brenner, Steven E., Meyn, M. Stephen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Whole‐genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state‐of‐the‐art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.
ISSN:1059-7794
1098-1004
DOI:10.1002/humu.23874