A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information

Background Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different cli...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular genetics & genomic medicine 2020-02, Vol.8 (2), p.e1085-n/a, Article 1085
Hauptverfasser: Vitale, Ornella, Preste, Roberto, Palmisano, Donato, Attimonelli, Marcella
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Background Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal. Methods In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a “data and text mining” pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database. Results The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features. Conclusions The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task. The “data and text mining” pipeline, developed using the programming language R, is capable of extracting information from the literature regarding mtDNA functional studies and related clinical assessments. The application of the pipeline will contribute in supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers that approach to this task.
ISSN:2324-9269
2324-9269
DOI:10.1002/mgg3.1085