MOCCASIN: a method for correcting for known and unknown confounders in RNA splicing analysis

The effects of confounding factors on gene expression analysis have been extensively studied following the introduction of high-throughput microarrays and subsequently RNA sequencing. In contrast, there is a lack of equivalent analysis and tools for RNA splicing. Here we first assess the effect of c...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature communications 2021-06, Vol.12 (1), p.3353-3353, Article 3353
Hauptverfasser: Slaff, Barry, Radens, Caleb M., Jewell, Paul, Jha, Anupama, Lahens, Nicholas F., Grant, Gregory R., Thomas-Tikhonenko, Andrei, Lynch, Kristen W., Barash, Yoseph
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The effects of confounding factors on gene expression analysis have been extensively studied following the introduction of high-throughput microarrays and subsequently RNA sequencing. In contrast, there is a lack of equivalent analysis and tools for RNA splicing. Here we first assess the effect of confounders on both expression and splicing quantifications in two large public RNA-Seq datasets (TARGET, ENCODE). We show quantification of splicing variations are affected at least as much as those of gene expression, revealing unwanted sources of variations in both datasets. Next, we develop MOCCASIN, a method to correct the effect of both known and unknown confounders on RNA splicing quantification and demonstrate MOCCASIN’s effectiveness on both synthetic and real data. Code, synthetic and corrected datasets are all made available as resources. Confounding factors on gene expression analysis can be analyzed by several existing tools. Here the authors develop an algorithm called MOCCASIN to correct the effect of known and unknown confounders on RNA splicing quantification.
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-021-23608-9