Geometric analysis of SARS-CoV-2 variants

•Geometric space is established for the study of SARS-CoV-2 data set.•In geometric space, SARS-CoV-2 sequences from same variants cluster together.•Distances between points of geometric space reflect their biological distances.•For points closer in geometric space, their biological relationships are...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Gene 2024-05, Vol.909, p.148291-148291, Article 148291
Hauptverfasser: Guan, Mengcen, Sun, Nan, Yau, Stephen S.-T.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Geometric space is established for the study of SARS-CoV-2 data set.•In geometric space, SARS-CoV-2 sequences from same variants cluster together.•Distances between points of geometric space reflect their biological distances.•For points closer in geometric space, their biological relationships are closer. SARS-CoV-2 as a severe respiratory disease has been prevalent around the world since its first discovery in 2019.As a single-stranded RNA virus, its high mutation rate makes its variants manifold and enables some of them to have high pathogenicity, such as Omicron variant, the most prevalent virus now. Research on the relationship of these SARS-CoV-2 variants, especially exploring their difference is a hot issue. In this study, we constructed a geometric space to represent all SARS-CoV-2 sequences of different variants. An alignment-free method: natural vector method was utilized to establish genome space. The genome space of SARS-CoV-2 was constructed based on the 24-dimensional natural vector and the appropriate metric was determined through performing phylogenetic analysises. Phylogenetic trees of different lineages constructed under the selected natural vector and metric coincided with the lineage naming standards, which means lineages with same alphabetical prefix cluster in phylogenetic trees. Furthermore, the relationships between the various GISAID clades as depicted by the natural graph primarily matched the description provided in the GISAID clade naming.The validity of our geometric space was demonstrated by these phylogenetic analysis results. So in this research, we constructed a geometry space for the genomes of the novel coronavirus SARS-CoV-2, which allows us to compare the different variants. Our geometric space is valuable for resolving the issues insides the virus.
ISSN:0378-1119
1879-0038
DOI:10.1016/j.gene.2024.148291