A data and text mining pipeline to annotate human mitochondrial variants with functional and clinical information
Background Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different cli...
Gespeichert in:
Veröffentlicht in: | Molecular genetics & genomic medicine 2020-02, Vol.8 (2), p.e1085-n/a, Article 1085 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Background
Human mitochondrial DNA has an important role in the cellular energy production through oxidative phosphorylation. Therefore, this process may be the cause and have an effect on mitochondrial DNA mutability, functional alteration, and disease onset related to a wide range of different clinical expressions and phenotypes. Although a large part of the observed variations is fixed in a population and hence expected to be benign, the estimation of the degree of the pathogenicity of any possible human mitochondrial DNA variant is clinically pivotal.
Methods
In this scenario, the establishment of standard criteria based on functional studies is required. In this context, a “data and text mining” pipeline is proposed here, developed using the programming language R, capable of extracting information regarding mitochondrial DNA functional studies and related clinical assessments from the literature, thus improving the annotation of human mitochondrial variants reported in the HmtVar database.
Results
The data mining pipeline has produced a list of 1,073 Pubmed IDs (PMIDs) from which the text mining pipeline has retrieved information on 932 human mitochondrial variants regarding experimental validation and clinical features.
Conclusions
The application of the pipeline will contribute to supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers faced with this task.
The “data and text mining” pipeline, developed using the programming language R, is capable of extracting information from the literature regarding mtDNA functional studies and related clinical assessments. The application of the pipeline will contribute in supporting the interpretation of pathogenicity of human mitochondrial variants by facilitating diagnosis to clinicians and researchers that approach to this task. |
---|---|
ISSN: | 2324-9269 2324-9269 |
DOI: | 10.1002/mgg3.1085 |