Identification of unannotated coding sequences and their physiological functions

Most protein-coding sequences (CDSs) are predicted sequences based on criteria such as a size sufficient to encode a product of at least 100 amino acids and with translation starting at an AUG initiation codon. However, recent studies based on ribosome profiling and mass spectrometry have shown that...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biochemistry (Tokyo) 2023-03, Vol.173 (4), p.237-242
Hauptverfasser: Ichihara, Kazuya, Nakayama, Keiichi I, Matsumoto, Akinobu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Most protein-coding sequences (CDSs) are predicted sequences based on criteria such as a size sufficient to encode a product of at least 100 amino acids and with translation starting at an AUG initiation codon. However, recent studies based on ribosome profiling and mass spectrometry have shown that several RNAs annotated as long as noncoding RNAs are actually translated to generate polypeptides of fewer than 100 amino acids and that many proteins are translated from near-cognate initiation codons such as CUG and GUG. Furthermore, studies of genetically engineered mouse models have revealed that such polypeptides and proteins contribute to diverse physiological processes. In this review, we describe the latest methods for the identification of unannotated CDSs and provide examples of their physiological functions.
ISSN:0021-924X
1756-2651
DOI:10.1093/jb/mvac064