Microsatellite density landscapes illustrate short tandem repeats aggregation in the complete reference human genome

Microsatellites are increasingly realized to have biological significance in human genome and health in past decades, the assembled complete reference sequence of human genome T2T-CHM13 brought great help for a comprehensive study of short tandem repeats in the human genome. Microsatellites density...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC genomics 2024-10, Vol.25 (1), p.960-16, Article 960
Hauptverfasser: Xia, Yun, Li, Douyue, Chen, Tingyi, Pan, Saichao, Huang, Hanrou, Zhang, Wenxiang, Liang, Yulin, Fu, Yongzhuo, Peng, Zhuli, Zhang, Hongxi, Zhang, Liang, Peng, Shan, Shi, Ruixue, He, Xingxin, Zhou, Siqian, Jiao, Weili, Zhao, Xiangyan, Wu, Xiaolong, Zhou, Lan, Zhou, Jingyu, Ouyang, Qingjian, Tian, You, Jiang, Xiaoping, Zhou, Yi, Tang, Shiying, Shen, Junxiong, Ohshima, Kazusato, Tan, Zhongyang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Microsatellites are increasingly realized to have biological significance in human genome and health in past decades, the assembled complete reference sequence of human genome T2T-CHM13 brought great help for a comprehensive study of short tandem repeats in the human genome. Microsatellites density landscapes of all 24 chromosomes were built here for the first complete reference sequence of human genome T2T-CHM13. These landscapes showed that short tandem repeats (STRs) are prone to aggregate characteristically to form a large number of STRs density peaks. We classified 8,823 High Microsatellites Density Peaks (HMDPs), 35,257 Middle Microsatellites Density Peaks (MMDPs) and 199, 649 Low Microsatellites Density Peaks (LMDPs) on the 24 chromosomes; and also classified the motif types of every microsatellites density peak. These STRs density aggregation peaks are mainly composing of a single motif, and AT is the most dominant motif, followed by AATGG and CCATT motifs. And 514 genomic regions were characterized by microsatellite density feature in the full T2T-CHM13 genome. These landscape maps exhibited that microsatellites aggregate in many genomic positions to form a large number of microsatellite density peaks with composing of mainly single motif type in the complete reference genome, indicating that the local microsatellites density varies enormously along the every chromosome of T2T-CHM13.
ISSN:1471-2164
1471-2164
DOI:10.1186/s12864-024-10843-9