Improved analytical methods for microarray-based genome-composition analysis

Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be us...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Genome Biology (Online Edition) 2002-10, Vol.3 (11), p.RESEARCH0065-RESEARCH0065, Article research0065.1
Hauptverfasser:	Kim, Charles C, Joyce, Elizabeth A, Chan, Kaman, Falkow, Stanley
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Analysis Bacteria Campylobacter jejuni - genetics Computational Biology - methods Computational Biology - standards Computational Biology - statistics & numerical data Databases, Genetic - statistics & numerical data DNA sequencing Genes Genes, Bacterial - genetics Genetic research Genome, Bacterial Genomes Genomics Genotype Helicobacter pylori Helicobacter pylori - genetics Humans Methods Oligonucleotide Array Sequence Analysis - methods Oligonucleotide Array Sequence Analysis - standards Oligonucleotide Array Sequence Analysis - statistics & numerical data Reference Standards Software
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Whereas genome sequencing has given us high-resolution pictures of many different species of bacteria, microarrays provide a means of obtaining information on genome composition for many strains of a given species. Genome-composition analysis using microarrays, or 'genomotyping', can be used to categorize genes into 'present' and 'divergent' categories based on the level of hybridization signal. This typically involves selecting a signal value that is used as a cutoff to discriminate present (high signal) and divergent (low signal) genes. Current methodology uses empirical determination of cutoffs for classification into these categories, but this methodology is subject to several problems that can result in the misclassification of many genes. We describe a method that depends on the shape of the signal-ratio distribution and does not require empirical determination of a cutoff. Moreover, the cutoff is determined on an array-to-array basis, accounting for variation in strain composition and hybridization quality. The algorithm also provides an estimate of the probability that any given gene is present, which provides a measure of confidence in the categorical assignments. Many genes previously classified as present using static methods are in fact divergent on the basis of microarray signal; this is corrected by our algorithm. We have reassigned hundreds of genes from previous genomotyping studies of Helicobacter pylori and Campylobacter jejuni strains, and expect that the algorithm should be widely applicable to genomotyping data.
ISSN:	1474-760X 1465-6906 1474-760X 1465-6914
DOI:	10.1186/gb-2002-3-11-research0065