Nonrandom patterns of simple and cryptic triplet repeats in coding and noncoding sequences
Triplet repeats of the sequence purine, purine, and pyrimidine [RRY(i)] are frequent and often polymorphic in humans. Some RRY(i) are composed predominantly of a continuous repeat of one sequence [simple RRY(i)], but the majority are cryptic RRY(i) that are not obvious until the bases are classified...
Gespeichert in:
Veröffentlicht in: | Genomics (San Diego, Calif.) Calif.), 1995-04, Vol.26 (3), p.510-520 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Triplet repeats of the sequence purine, purine, and pyrimidine [RRY(i)] are frequent and often polymorphic in humans. Some RRY(i) are composed predominantly of a continuous repeat of one sequence [simple RRY(i)], but the majority are cryptic RRY(i) that are not obvious until the bases are classified into R or Y before the full extent of the repeat becomes apparent. RRY(i) can be divided into 18 classes based on predominant nucleotides. These classes are highly nonrandom in abundance and in location within genes. In humans, simple or cryptic RRY(i), in which AAT or AAC triplets predominate, are preferentially located 3′ of
Alu repeats. RRY(i) with a predominance of AGC or GGC show a dramatic enrichment in coding sequence, and GGC also shows a dramatic enrichment in 5′ untranslated regions of genes. Characterization of RRY(i) present in coding regions identify 10 protein motifs (A
n
, D
n
, H
n
, P
n
, Q
n
, T
n
, G
n
S
0–3G
m
, (G/S)
n
, (S/G/N)
n
, and (L/P)
n
). Six of the protein motifs appear predominantly in DNA-binding proteins/transcription factors. Alignment of homologous protein sequences from other mammals reveals that both simple and cryptic RRY(i) are a major source of deletions or insertions in the genes that contain them. Cryptic RRY(i) may be candidates for triplet repeat genetic diseases and, when mutated in somatic cells, may contribute to carcinogenesis. |
---|---|
ISSN: | 0888-7543 1089-8646 |
DOI: | 10.1016/0888-7543(95)80169-M |