Comparative genomic signature representations of the emerging COVID-19 coronavirus and other coronaviruses: High identity and possible recombination between Bat and Pangolin coronaviruses

Coronaviruses are responsible on respiratory diseases in animal and human. The combination of numerical encoding techniques and digital signal processing methods are becoming increasingly important in handling large genomic data. In this paper, we propose to analyze the SARS-CoV-2 genomic signature...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genomics (San Diego, Calif.) Calif.), 2020-11, Vol.112 (6), p.4189-4202
Hauptverfasser: Touati, Rabeb, Haddad-Boubaker, Sondes, Ferchichi, Imen, Messaoudi, Imen, Ouesleti, Afef Elloumi, Triki, Henda, Lachiri, Zied, Kharrat, Maher
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Coronaviruses are responsible on respiratory diseases in animal and human. The combination of numerical encoding techniques and digital signal processing methods are becoming increasingly important in handling large genomic data. In this paper, we propose to analyze the SARS-CoV-2 genomic signature using the combination of different nucleotide representations and signal processing tools in the aim to identify its genetic origin. The sequence of SARS-CoV-2 was compared with 21 relevant sequences including Bat, Yak and Pangolin coronavirus sequences. In addition, we developed a new algorithm to locate the nucleotide modifications. The results show that the Bat and Pangolin coronaviruses were the most related to SARS-CoV-2 with 96% and 86% of identity all along the genome. Within the S gene sequence, the Pangolin sequence presents local highest nucleotide identity. Those findings suggest genesis of SARS-Cov-2 through evolution from Bat and Pangolin strains. This study offers new ways to automatically characterize viruses. •We propose to analyze the SARS-CoV-2 genomic signature using the combination of different nucleotide representations in the aim to identify its genetic origin.•The SARS-CoV-2 sequence was compared with 21 relevant sequences including Bat, Yak and Pangolin coronavirus sequences.•the Bat and Pangolin coronaviruses were the most related to SARS-CoV-2•Within the S gene sequence, the Pangolin sequence presents local highest nucleotide identity.•This study suggests genesis of SARS-Cov-2 through evolution from bat and pangolin strains.
ISSN:0888-7543
1089-8646
DOI:10.1016/j.ygeno.2020.07.003