Quantitative sequence and open reading frame analysis based on codon bias

The frequencies with which the sixty-four codons occur in human coding DNA are known. If we assume that the codons occur randomly, subject only to these probabilities, then it is possible to predict trinucleotide frequencies in each of the five other reading frames. A model is developed for evaluati...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of systemics, cybernetics and informatics cybernetics and informatics, 2006-02, Vol.4 (1), p.65-72
Hauptverfasser: Susan Rainey, Joe Repka
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The frequencies with which the sixty-four codons occur in human coding DNA are known. If we assume that the codons occur randomly, subject only to these probabilities, then it is possible to predict trinucleotide frequencies in each of the five other reading frames. A model is developed for evaluating the extent to which a given sequence has trinucleotide frequencies compatible with coding DNA. This model is tested using known samples of coding DNA taken at random from GenBank, and good agreement is found. Practical and theoretical applications are discussed, including determination of coding open reading frames, evaluation of sequence data for frameshift mutations and examination of hypothetical genes.
ISSN:1690-4524