Syncmer-based evolutionary distance estimation and phylogenetic tree construction method

The invention discloses a syncmer-based evolutionary distance estimation and phylogenetic tree construction method. The method comprises the following steps of reading a sequence file and performing primary processing on a sequence; all kmer with the length of k on the sequence are traversed, and sm...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WANG YAOBIN, NIE JINTONG, LIU HUAN, HUANG FAJUN, LUO MINZHONG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a syncmer-based evolutionary distance estimation and phylogenetic tree construction method. The method comprises the following steps of reading a sequence file and performing primary processing on a sequence; all kmer with the length of k on the sequence are traversed, and smer with the minimum hash value on the kmer is found and judged; according to a MinHash algorithm, n syncmers with minimum hash values are finally reserved to form a sketch; comparing the sketches in pairs to calculate an evolutionary distance matrix; and converting the distance matrix into a phylogenetic tree. The objective of the invention is to solve the problems that when a conventional phylogenetic tree construction method based on distance is used for processing a large-scale data set, resource consumption is increased, calculation efficiency is reduced, accuracy is reduced, expansibility is poor, and stability cannot be maintained when a sequence is subjected to local mutation. The invention provides the sync