Large scale genotype‐ and phenotype‐driven machine learning in Von Hippel‐Lindau disease

Von Hippel‐Lindau (VHL) disease is a hereditary cancer syndrome where individuals are predisposed to tumor development in the brain, adrenal gland, kidney, and other organs. It is caused by pathogenic variants in the VHL tumor suppressor gene. Standardized disease information has been difficult to c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Human mutation 2022-09, Vol.43 (9), p.1268-1285
Hauptverfasser: Chiorean, Andreea, Farncombe, Kirsten M., Delong, Sean, Andric, Veronica, Ansar, Safa, Chan, Clarissa, Clark, Kaitlin, Danos, Arpad M., Gao, Yizhuo, Giles, Rachel H., Goldenberg, Anna, Jani, Payal, Krysiak, Kilannin, Kujan, Lynzey, Macpherson, Samantha, Maher, Eamonn R., McCoy, Liam G., Salama, Yasser, Saliba, Jason, Sheta, Lana, Griffith, Malachi, Griffith, Obi L., Erdman, Lauren, Ramani, Arun, Kim, Raymond H.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Von Hippel‐Lindau (VHL) disease is a hereditary cancer syndrome where individuals are predisposed to tumor development in the brain, adrenal gland, kidney, and other organs. It is caused by pathogenic variants in the VHL tumor suppressor gene. Standardized disease information has been difficult to collect due to the rarity and diversity of VHL patients. Over 4100 unique articles published until October 2019 were screened for germline genotype–phenotype data. Patient data were translated into standardized descriptions using Human Genome Variation Society gene variant nomenclature and Human Phenotype Ontology terms and has been manually curated into an open‐access knowledgebase called Clinical Interpretation of Variants in Cancer. In total, 634 unique VHL variants, 2882 patients, and 1991 families from 427 papers were captured. We identified relationship trends between phenotype and genotype data using classic statistical methods and spectral clustering unsupervised learning. Our analyses reveal earlier onset of pheochromocytoma/paraganglioma and retinal angiomas, phenotype co‐occurrences and genotype–phenotype correlations including hotspots. It confirms existing VHL associations and can be used to identify new patterns and associations in VHL disease. Our database serves as an aggregate knowledge translation tool to facilitate sharing information about the pathogenicity of VHL variants.
ISSN:1059-7794
1098-1004
DOI:10.1002/humu.24392