COMPUTER FILES AND METHODS SUPPORTING FORENSIC ANALYSIS OF NUCLEOTIDE SEQUENCE DATA

In one illustrative embodiment, a method may comprise receiving a first text-based computer file including one or more records, each of the one or more records comprising nucleotide sequence data generated by a read of a massively parallel sequencing instrument, determining whether a portion of the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: BORNMAN, Daniel M, HEIZER, Esley M. Jr, MINARD-SMITH, Angela T, HESTER, Mark E, YANG, Boyu, YOUNG, Brian A
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In one illustrative embodiment, a method may comprise receiving a first text-based computer file including one or more records, each of the one or more records comprising nucleotide sequence data generated by a read of a massively parallel sequencing instrument, determining whether a portion of the nucleotide sequence data of each record represents a short tandem repeat (STR) associated with a locus, placing each portion of the nucleotide sequence data determined to represent an STR associated with a locus into one of a number of locus-specific lists, determining a number of occurrences within each locus-specific list of identical nucleotide sequence data representing a unique STR, and generating a second text-based computer file including one or more records, each of the one or more records corresponding to a unique STR for which the number of occurrences of identical nucleotide sequence data representing the unique STR exceeded an abundance threshold.