Method for judging saturation of sequencing data, computer readable medium and application

The invention provides a method for judging the saturation of sequencing data, a computer readable medium and an application, and relates to the field of sequencing technology. The method comprises the steps of: (a) providing the sequencing data, the sequencing data being a data set A comprising X r...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: XIAO FANG, JIA RUIKAI, YE HUA, JIA YANKAI, LIAO GUOJUAN, GUO SEN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a method for judging the saturation of sequencing data, a computer readable medium and an application, and relates to the field of sequencing technology. The method comprises the steps of: (a) providing the sequencing data, the sequencing data being a data set A comprising X reads; (b) clustering the X reads according to a preset sequence similarity threshold to generate N Clusters; (c) obtaining Probalility; the Probalility being the probability that the number of Clusters obtained by extracting the k-1-th read is i-1, one reads is then extracted, and the number of Clusters obtained is i; wherein k is a positive integer less than or equal to X, and i is a positive integer less than or equal to N; and (d) obtaining an index Saturated that measures the degree of saturation of the data, the more the data saturation degree index Saturated approaches zero, the more the sequencing data tend to be saturated. The method can accurately reflect the saturation degree of the sequencing data by num