Diagram of the SAP

Copyright information:Taken from "Draft versus finished sequence data for DNA and protein diagnostic signature development"Nucleic Acids Research 2005;33(18):5838-5850.Published online 20 Oct 2005PMCID:PMC1266063.© The Author 2005. Published by Oxford University Press. All rights reserved...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gardner, Shea N., Lam, Marisa W., Smith, Jason R., Torres, Clinton L., Slezak, Tom R.
Format: Bild
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Copyright information:Taken from "Draft versus finished sequence data for DNA and protein diagnostic signature development"Nucleic Acids Research 2005;33(18):5838-5850.Published online 20 Oct 2005PMCID:PMC1266063.© The Author 2005. Published by Oxford University Press. All rights reserved For an SAP run, first a pool of target genome and a pool of NN genomes are collected. Then many random subsamples of target and NN genomes are selected from the pool, and each subsample is run through either the DNA signature pipeline or the protein signature pipeline, which identify regions conserved among target genomes and unique relative to non-target genomes, where unique regions are evaluated by comparing to a large sequence database of all currently available bacterial and viral complete genomes or the non-redundant protein database, excluding NNs from the NN pool that are not in that random subsample. Thus, each run of the SAP requires many runs of the DNA or protein signature pipelines with different random samples, generating a range of outcomes that are plotted on range plots.
DOI:10.6084/m9.figshare.5811