Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference

The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single S...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Baetscher, Diana S., Clemento, Anthony J., Ng, Thomas C., Anderson, Eric C., Garza, John C.
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Baetscher, Diana S.
Clemento, Anthony J.
Ng, Thomas C.
Anderson, Eric C.
Garza, John C.
description The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by high-throughput sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from 96 such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full sibling and single parent-offspring pair identification. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.
doi_str_mv 10.5061/dryad.5863d
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_5863d</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5061_dryad_5863d</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5061_dryad_5863d3</originalsourceid><addsrcrecordid>eNqVjrEKwjAURbM4iDr5A2-X1pbSIm5iFRed3ENIXmiwbeJLVPr3psUfcLpw77lwGFvnWVpmVb5VNAiVlruqUHP2qEUQoMl2e7gaSbYRrrVhcOjBkX0bhWB6SSg8KnD2gzTR4BtLIYm9gvp2AI_PF_YyvrQlIGxFMLb3jXHxrpHGbclmWrQeV79csM35dD9eEhUdpAnIHZlO0MDzjI-qfFLlk2rxH_0FRW9PTA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><source>DataCite</source><creator>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</creator><creatorcontrib>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</creatorcontrib><description>The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by high-throughput sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from 96 such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full sibling and single parent-offspring pair identification. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.</description><identifier>DOI: 10.5061/dryad.5863d</identifier><language>eng</language><publisher>Dryad</publisher><subject>current ; High throughput DNA sequencing ; Microhaplotype ; Parentage ; Relationship Inference</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1894</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5061/dryad.5863d$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Baetscher, Diana S.</creatorcontrib><creatorcontrib>Clemento, Anthony J.</creatorcontrib><creatorcontrib>Ng, Thomas C.</creatorcontrib><creatorcontrib>Anderson, Eric C.</creatorcontrib><creatorcontrib>Garza, John C.</creatorcontrib><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><description>The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by high-throughput sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from 96 such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full sibling and single parent-offspring pair identification. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.</description><subject>current</subject><subject>High throughput DNA sequencing</subject><subject>Microhaplotype</subject><subject>Parentage</subject><subject>Relationship Inference</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2017</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqVjrEKwjAURbM4iDr5A2-X1pbSIm5iFRed3ENIXmiwbeJLVPr3psUfcLpw77lwGFvnWVpmVb5VNAiVlruqUHP2qEUQoMl2e7gaSbYRrrVhcOjBkX0bhWB6SSg8KnD2gzTR4BtLIYm9gvp2AI_PF_YyvrQlIGxFMLb3jXHxrpHGbclmWrQeV79csM35dD9eEhUdpAnIHZlO0MDzjI-qfFLlk2rxH_0FRW9PTA</recordid><startdate>20171114</startdate><enddate>20171114</enddate><creator>Baetscher, Diana S.</creator><creator>Clemento, Anthony J.</creator><creator>Ng, Thomas C.</creator><creator>Anderson, Eric C.</creator><creator>Garza, John C.</creator><general>Dryad</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>20171114</creationdate><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><author>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5061_dryad_5863d3</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2017</creationdate><topic>current</topic><topic>High throughput DNA sequencing</topic><topic>Microhaplotype</topic><topic>Parentage</topic><topic>Relationship Inference</topic><toplevel>online_resources</toplevel><creatorcontrib>Baetscher, Diana S.</creatorcontrib><creatorcontrib>Clemento, Anthony J.</creatorcontrib><creatorcontrib>Ng, Thomas C.</creatorcontrib><creatorcontrib>Anderson, Eric C.</creatorcontrib><creatorcontrib>Garza, John C.</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Baetscher, Diana S.</au><au>Clemento, Anthony J.</au><au>Ng, Thomas C.</au><au>Anderson, Eric C.</au><au>Garza, John C.</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><date>2017-11-14</date><risdate>2017</risdate><abstract>The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single SNP per sequenced region ignores substantial additional information in the phased short-read sequences that are provided by high-throughput sequencing instruments. We target sequenced regions with multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine “microhaplotypes” and then call these microhaplotypes as alleles at each locus. We then demonstrate how these multi-allelic marker data from 96 such loci dramatically increase power for relationship inference. The microhaplotype approach decreases false positive rates by several orders of magnitude, relative to calling bi-allelic SNPs, for two challenging analytical procedures, full sibling and single parent-offspring pair identification. The advent of phased short-read DNA sequence data, in conjunction with emerging analytical tools for their analysis, promises to improve efficiency by reducing the number of loci necessary for a particular level of statistical confidence, thereby lowering the cost of data collection and reducing the degree of physical linkage amongst markers used for relationship estimation. Such advances will facilitate collaborative research and management for migratory and other widespread species.</abstract><pub>Dryad</pub><doi>10.5061/dryad.5863d</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5061/dryad.5863d
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5061_dryad_5863d
source DataCite
subjects current
High throughput DNA sequencing
Microhaplotype
Parentage
Relationship Inference
title Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T13%3A43%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Baetscher,%20Diana%20S.&rft.date=2017-11-14&rft_id=info:doi/10.5061/dryad.5863d&rft_dat=%3Cdatacite_PQ8%3E10_5061_dryad_5863d%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true