A statistical approach for array CGH data analysis

Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC bioinformatics 2005-02, Vol.6 (1), p.27-27, Article 27
Hauptverfasser:	Picard, Franck, Robin, Stephane, Lavielle, Marc, Vaisse, Christian, Daudin, Jean-Jacques
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Chromosome Aberrations Chromosome Mapping Chromosomes, Artificial, Bacterial - metabolism Computational Biology - methods Computer Graphics Computer Simulation Data Interpretation, Statistical Database Management Systems DNA - genetics Gene Dosage Gene Expression Profiling Genetic Markers Genome Genome, Human Humans Life Sciences Models, Genetic Models, Statistical Normal Distribution Nucleic Acid Conformation Nucleic Acid Hybridization Oligonucleotide Array Sequence Analysis Oligonucleotide Probes Software User-Computer Interface
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Microarray-CGH experiments are used to detect and map chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample to sequences immobilized on a slide. These probes are genomic DNA sequences (BACs) that are mapped on the genome. The signal has a spatial coherence that can be handled by specific statistical tools. Segmentation methods seem to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number on average. We model a CGH profile by a random Gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problems arise: to determine which parameters are affected by the abrupt changes (the mean and the variance, or the mean only), and the selection of the number of segments in the profile. We demonstrate that existing methods for estimating the number of segments are not well adapted in the case of array CGH data, and we propose an adaptive criterion that detects previously mapped chromosomal aberrations. The performances of this method are discussed based on simulations and publicly available data sets. Then we discuss the choice of modeling for array CGH data and show that the model with a homogeneous variance is adapted to this context. Array CGH data analysis is an emerging field that needs appropriate statistical tools. Process segmentation and model selection provide a theoretical framework that allows precise biological interpretations. Adaptive methods for model selection give promising results concerning the estimation of the number of altered regions on the genome.
ISSN:	1471-2105 1471-2105
DOI:	10.1186/1471-2105-6-27