A protocol to evaluate RNA sequencing normalization methods

RNA sequencing technologies have allowed researchers to gain a better understanding of how the transcriptome affects disease. However, sequencing technologies often unintentionally introduce experimental error into RNA sequencing data. To counteract this, normalization methods are standardly applied...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC bioinformatics 2019-12, Vol.20 (Suppl 24), p.679-679, Article 679
Hauptverfasser:	Abrams, Zachary B, Johnson, Travis S, Huang, Kun, Payne, Philip R O, Coombes, Kevin
Format:	Artikel
Sprache:	eng
Schlagworte:	Biological variability Biology Comparative analysis Datasets Evaluation Experiments Gene expression Gene sequencing Genetic research Genome Genomes Genomics Humans Methods Normalization Preservation Ribonucleic acid RNA RNA - genetics RNA sequencing RNASeq Sequence Analysis, RNA - methods Standard data Standardization Technology Transcriptome Transcriptomics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	RNA sequencing technologies have allowed researchers to gain a better understanding of how the transcriptome affects disease. However, sequencing technologies often unintentionally introduce experimental error into RNA sequencing data. To counteract this, normalization methods are standardly applied with the intent of reducing the non-biologically derived variability inherent in transcriptomic measurements. However, the comparative efficacy of the various normalization techniques has not been tested in a standardized manner. Here we propose tests that evaluate numerous normalization techniques and applied them to a large-scale standard data set. These tests comprise a protocol that allows researchers to measure the amount of non-biological variability which is present in any data set after normalization has been performed, a crucial step to assessing the biological validity of data following normalization. In this study we present two tests to assess the validity of normalization methods applied to a large-scale data set collected for systematic evaluation purposes. We tested various RNASeq normalization procedures and concluded that transcripts per million (TPM) was the best performing normalization method based on its preservation of biological signal as compared to the other methods tested. Normalization is of vital importance to accurately interpret the results of genomic and transcriptomic experiments. More work, however, needs to be performed to optimize normalization methods for RNASeq data. The present effort helps pave the way for more systematic evaluations of normalization methods across different platforms. With our proposed schema researchers can evaluate their own or future normalization methods to further improve the field of RNASeq normalization.
ISSN:	1471-2105 1471-2105
DOI:	10.1186/s12859-019-3247-x