Rapid evolution of protein diversity by de novo origination in Oryza

New protein-coding genes that arise de novo from non-coding DNA sequences contribute to protein diversity. However, de novo gene origination is challenging to study as it requires high-quality reference genomes for closely related species, evidence for ancestral non-coding sequences, and transcripti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature ecology & evolution 2019-04, Vol.3 (4), p.679-690
Hauptverfasser: Zhang, Li, Ren, Yan, Yang, Tao, Li, Guangwei, Chen, Jianhai, Gschwend, Andrea R., Yu, Yeisoo, Hou, Guixue, Zi, Jin, Zhou, Ruo, Wen, Bo, Zhang, Jianwei, Chougule, Kapeel, Wang, Muhua, Copetti, Dario, Peng, Zhiyu, Zhang, Chengjun, Zhang, Yong, Ouyang, Yidan, Wing, Rod A., Liu, Siqi, Long, Manyuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:New protein-coding genes that arise de novo from non-coding DNA sequences contribute to protein diversity. However, de novo gene origination is challenging to study as it requires high-quality reference genomes for closely related species, evidence for ancestral non-coding sequences, and transcription and translation of the new genes. High-quality genomes of 13 closely related Oryza species provide unprecedented opportunities to understand de novo origination events. Here, we identify a large number of young de novo genes with discernible recent ancestral non-coding sequences and evidence of translation. Using pipelines examining the synteny relationship between genomes and reciprocal-best whole-genome alignments, we detected at least 175 de novo open reading frames in the focal species O. sativa subspecies japonica , which were all detected in RNA sequencing-based transcriptomes. Mass spectrometry-based targeted proteomics and ribosomal profiling show translational evidence for 57% of the de novo genes. In recent divergence of Oryza , an average of 51.5 de novo genes per million years were generated and retained. We observed evolutionary patterns in which excess indels and early transcription were favoured in origination with a stepwise formation of gene structure. These data reveal that de novo genes contribute to the rapid evolution of protein diversity under positive selection. Analysis of high-quality genomes and transcriptomes of 13 closely related Oryza species coupled with proteomics finds about 50 de novo genes per million years.
ISSN:2397-334X
2397-334X
DOI:10.1038/s41559-019-0822-5