pipeline for high throughput detection and mapping of SNPs from EST databases

Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Molecular breeding 2010-06, Vol.26 (1), p.65-75
Hauptverfasser:	Anithakumari, A. M, Tang, Jifeng, van Eck, Herman J, Visser, Richard G. F, Leunissen, Jack A. M, Vosman, Ben, van der Linden, C. Gerard
Format:	Artikel
Sprache:	eng
Schlagworte:	Biomedical and Life Sciences Biotechnology construction discovery EST database Expressed sequence tags Gene mapping Genetic diversity genome Genotyping haplotype Illumina GoldenGate assay Life Sciences linkage maps Loci map-based cloning Mapping Markers Molecular biology Nucleotides Plant biology Plant Genetics and Genomics Plant Pathology Plant Physiology Plant Sciences Populations potato Potatoes QualitySNP Single-nucleotide polymorphism single-nucleotide polymorphisms Solanum tuberosum varieties
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free as much as possible, targeting single loci and suitable for the SNP scoring platform of choice. We have developed a pipeline to effectively mine SNPs from public EST databases with or without quality information using QualitySNP software, select reliable SNP and prepare the loci for analysis on the Illumina GoldenGate genotyping platform. The applicability of the pipeline was demonstrated using publicly available potato EST data, genotyping individuals from two diploid mapping populations and subsequently mapping the SNP markers (putative genes) in both populations. Over 7000 reliable SNPs were identified that met the criteria for genotyping on the GoldenGate platform. Of the 384 SNPs on the SNP array approximately 12% dropped out. For the two potato mapping populations 165 and 185 SNPs segregating SNP loci could be mapped on the respective genetic maps, illustrating the effectiveness of our pipeline for SNP selection and validation.
ISSN:	1380-3743 1572-9788
DOI:	10.1007/s11032-009-9377-5