Characterizing and annotating the genome using RNA-seq data
Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technol- ogies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for impr...
Gespeichert in:
Veröffentlicht in: | Science China. Life sciences 2017-02, Vol.60 (2), p.116-125 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Bioinformatics methods for various RNA-seq data analyses are in fast evolution with the improvement of sequencing technol- ogies. However, many challenges still exist in how to efficiently process the RNA-seq data to obtain accurate and comprehensive results. Here we reviewed the strategies for improving diverse transcriptomic studies and the annotation of genetic vari- ants based on RNA-seq data. Mapping RNA-seq reads to the genome and transcriptome represent two distinct methods for quantifying the expression of genes/transcripts. Besides the known genes annotated in current databases, many novel genes/transcripts (especially those long noncoding RNAs) still can be identified on the reference genome using RNA-seq. Moreover, owing to the incompleteness of current reference genomes, some novel genes are missing from them. Ge- nome-guided and de novo transcriptome reconstruction are two effective and complementary strategies for identifying those novel genes/transcripts on or beyond the reference genome. In addition, integrating the genes of distinct databases to conduct transcriptomics and genetics studies can improve the results of corresponding analyses. |
---|---|
ISSN: | 1674-7305 1869-1889 |
DOI: | 10.1007/s11427-015-0349-4 |