SAMPLE CONTAMINATION DETECTION OF CONTAMINATED FRAGMENTS FOR CANCER CLASSIFICATION
Methods and systems for detecting contaminated fragments in a biological sample for cancer classification are disclosed. The system identifies multiple SNP site contamination markers and indel site contamination markers. The multiple SNP site contamination markers include at least two SNP sites with...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | eng ; fre ; ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Methods and systems for detecting contaminated fragments in a biological sample for cancer classification are disclosed. The system identifies multiple SNP site contamination markers and indel site contamination markers. The multiple SNP site contamination markers include at least two SNP sites within a threshold distance, having population haplotype frequency within a range of threshold frequencies, excluding guanine-adenine polymorphisms and/or cytosine-thymine polymorphisms, ensuring Hardy-Weinberg equilibrium, or any combination of the parameters above. The indel site contamination markers include indel sequences that are within a threshold length, having high complexity, having population haplotype frequency within a range of threshold frequencies, ensuring Hardy-Weinberg equilibrium, or any combination of the parameters above. The system identifies contamination markers for which the sample is homozygous. The system estimates the contamination level of the sample by identifying fragments having a haplotype that is different than the homozygous haplotype of the respective contamination marker site. |
---|