CARMO: a comprehensive annotation platform for functional exploration of rice multi‐omics data

Summary High‐throughput technology is gradually becoming a powerful tool for routine research in rice. Interpretation of biological significance from the huge amount of data is a critical but non‐trivial task, especially for rice, for which gene annotations rely heavily on sequence similarity rather...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Plant journal : for cell and molecular biology 2015-07, Vol.83 (2), p.359-374
Hauptverfasser: Wang, Jiawei, Qi, Meifang, Liu, Jian, Zhang, Yijing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Summary High‐throughput technology is gradually becoming a powerful tool for routine research in rice. Interpretation of biological significance from the huge amount of data is a critical but non‐trivial task, especially for rice, for which gene annotations rely heavily on sequence similarity rather than direct experimental evidence. Here we describe the annotation platform for comprehensive annotation of rice multi‐omics data (CARMO), which provides multiple web‐based analysis tools for in‐depth data mining and visualization. The central idea involves systematic integration of 1819 samples from omics studies and diverse sources of functional evidence (15 401 terms), which are further organized into gene sets and higher‐level gene modules. In this way, the high‐throughput data may easily be compared across studies and platforms, and integration of multiple types of evidence allows biological interpretation from the level of gene functional modules with high confidence. In addition, the functions and pathways for thousands of genes lacking description or validation may be deduced based on concerted expression of genes within the constructed co‐expression networks or gene modules. Overall, CARMO provides comprehensive annotations for transcriptomic datasets, epi‐genomic modification sites, single nucleotide polymorphisms identified from genome re‐sequencing, and the large gene lists derived from these omics studies. Well‐organized results, as well as multiple tools for interactive visualization, are available through a user‐friendly web interface. Finally, we illustrate how CARMO enables biological insights using four examples, demonstrating that CARMO is a highly useful resource for intensive data mining and hypothesis generation based on rice multi‐omics data. CARMO is freely available online (http://bioinfo.sibs.ac.cn/carmo). Significance Statement The power of CARMO lies in the comprehensive collection and integration of information from both multi‐omics data and diverse functional evidence of rice, which was further curated into gene sets and higher level gene modules. In this way, the high‐throughput data can easily be compared across studies and platforms, and notably, integration of multiple types of evidence provides biological interpretation from the level of modules with high confidence. Examples in the manuscripts demonstrated that CARMO not only reproduced reported evidence, but also proposed useful functional insights for further experimental e
ISSN:0960-7412
1365-313X
DOI:10.1111/tpj.12894