Syncmer-based evolutionary distance estimation and phylogenetic tree construction method
The invention discloses a syncmer-based evolutionary distance estimation and phylogenetic tree construction method. The method comprises the following steps of reading a sequence file and performing primary processing on a sequence; all kmer with the length of k on the sequence are traversed, and sm...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention discloses a syncmer-based evolutionary distance estimation and phylogenetic tree construction method. The method comprises the following steps of reading a sequence file and performing primary processing on a sequence; all kmer with the length of k on the sequence are traversed, and smer with the minimum hash value on the kmer is found and judged; according to a MinHash algorithm, n syncmers with minimum hash values are finally reserved to form a sketch; comparing the sketches in pairs to calculate an evolutionary distance matrix; and converting the distance matrix into a phylogenetic tree. The objective of the invention is to solve the problems that when a conventional phylogenetic tree construction method based on distance is used for processing a large-scale data set, resource consumption is increased, calculation efficiency is reduced, accuracy is reduced, expansibility is poor, and stability cannot be maintained when a sequence is subjected to local mutation. The invention provides the sync |
---|