Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference
The accelerating rate at which DNA sequence data is now generated by high-throughput sequencing instruments provides both opportunities and challenges for population genetic and ecological investigations of animals and plants. We show here how the common practice of calling genotypes from a single S...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Dataset |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Baetscher, Diana S. Clemento, Anthony J. Ng, Thomas C. Anderson, Eric C. Garza, John C. |
description | The accelerating rate at which DNA sequence data is now generated by
high-throughput sequencing instruments provides both opportunities and
challenges for population genetic and ecological investigations of animals
and plants. We show here how the common practice of calling genotypes from
a single SNP per sequenced region ignores substantial additional
information in the phased short-read sequences that are provided by
high-throughput sequencing instruments. We target sequenced regions with
multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine
“microhaplotypes” and then call these microhaplotypes as alleles at each
locus. We then demonstrate how these multi-allelic marker data from 96
such loci dramatically increase power for relationship inference. The
microhaplotype approach decreases false positive rates by several orders
of magnitude, relative to calling bi-allelic SNPs, for two challenging
analytical procedures, full sibling and single parent-offspring pair
identification. The advent of phased short-read DNA sequence data, in
conjunction with emerging analytical tools for their analysis, promises to
improve efficiency by reducing the number of loci necessary for a
particular level of statistical confidence, thereby lowering the cost of
data collection and reducing the degree of physical linkage amongst
markers used for relationship estimation. Such advances will facilitate
collaborative research and management for migratory and other widespread
species. |
doi_str_mv | 10.5061/dryad.5863d |
format | Dataset |
fullrecord | <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_5863d</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5061_dryad_5863d</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5061_dryad_5863d3</originalsourceid><addsrcrecordid>eNqVjrEKwjAURbM4iDr5A2-X1pbSIm5iFRed3ENIXmiwbeJLVPr3psUfcLpw77lwGFvnWVpmVb5VNAiVlruqUHP2qEUQoMl2e7gaSbYRrrVhcOjBkX0bhWB6SSg8KnD2gzTR4BtLIYm9gvp2AI_PF_YyvrQlIGxFMLb3jXHxrpHGbclmWrQeV79csM35dD9eEhUdpAnIHZlO0MDzjI-qfFLlk2rxH_0FRW9PTA</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><source>DataCite</source><creator>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</creator><creatorcontrib>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</creatorcontrib><description>The accelerating rate at which DNA sequence data is now generated by
high-throughput sequencing instruments provides both opportunities and
challenges for population genetic and ecological investigations of animals
and plants. We show here how the common practice of calling genotypes from
a single SNP per sequenced region ignores substantial additional
information in the phased short-read sequences that are provided by
high-throughput sequencing instruments. We target sequenced regions with
multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine
“microhaplotypes” and then call these microhaplotypes as alleles at each
locus. We then demonstrate how these multi-allelic marker data from 96
such loci dramatically increase power for relationship inference. The
microhaplotype approach decreases false positive rates by several orders
of magnitude, relative to calling bi-allelic SNPs, for two challenging
analytical procedures, full sibling and single parent-offspring pair
identification. The advent of phased short-read DNA sequence data, in
conjunction with emerging analytical tools for their analysis, promises to
improve efficiency by reducing the number of loci necessary for a
particular level of statistical confidence, thereby lowering the cost of
data collection and reducing the degree of physical linkage amongst
markers used for relationship estimation. Such advances will facilitate
collaborative research and management for migratory and other widespread
species.</description><identifier>DOI: 10.5061/dryad.5863d</identifier><language>eng</language><publisher>Dryad</publisher><subject>current ; High throughput DNA sequencing ; Microhaplotype ; Parentage ; Relationship Inference</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1894</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5061/dryad.5863d$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Baetscher, Diana S.</creatorcontrib><creatorcontrib>Clemento, Anthony J.</creatorcontrib><creatorcontrib>Ng, Thomas C.</creatorcontrib><creatorcontrib>Anderson, Eric C.</creatorcontrib><creatorcontrib>Garza, John C.</creatorcontrib><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><description>The accelerating rate at which DNA sequence data is now generated by
high-throughput sequencing instruments provides both opportunities and
challenges for population genetic and ecological investigations of animals
and plants. We show here how the common practice of calling genotypes from
a single SNP per sequenced region ignores substantial additional
information in the phased short-read sequences that are provided by
high-throughput sequencing instruments. We target sequenced regions with
multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine
“microhaplotypes” and then call these microhaplotypes as alleles at each
locus. We then demonstrate how these multi-allelic marker data from 96
such loci dramatically increase power for relationship inference. The
microhaplotype approach decreases false positive rates by several orders
of magnitude, relative to calling bi-allelic SNPs, for two challenging
analytical procedures, full sibling and single parent-offspring pair
identification. The advent of phased short-read DNA sequence data, in
conjunction with emerging analytical tools for their analysis, promises to
improve efficiency by reducing the number of loci necessary for a
particular level of statistical confidence, thereby lowering the cost of
data collection and reducing the degree of physical linkage amongst
markers used for relationship estimation. Such advances will facilitate
collaborative research and management for migratory and other widespread
species.</description><subject>current</subject><subject>High throughput DNA sequencing</subject><subject>Microhaplotype</subject><subject>Parentage</subject><subject>Relationship Inference</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2017</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqVjrEKwjAURbM4iDr5A2-X1pbSIm5iFRed3ENIXmiwbeJLVPr3psUfcLpw77lwGFvnWVpmVb5VNAiVlruqUHP2qEUQoMl2e7gaSbYRrrVhcOjBkX0bhWB6SSg8KnD2gzTR4BtLIYm9gvp2AI_PF_YyvrQlIGxFMLb3jXHxrpHGbclmWrQeV79csM35dD9eEhUdpAnIHZlO0MDzjI-qfFLlk2rxH_0FRW9PTA</recordid><startdate>20171114</startdate><enddate>20171114</enddate><creator>Baetscher, Diana S.</creator><creator>Clemento, Anthony J.</creator><creator>Ng, Thomas C.</creator><creator>Anderson, Eric C.</creator><creator>Garza, John C.</creator><general>Dryad</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>20171114</creationdate><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><author>Baetscher, Diana S. ; Clemento, Anthony J. ; Ng, Thomas C. ; Anderson, Eric C. ; Garza, John C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5061_dryad_5863d3</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2017</creationdate><topic>current</topic><topic>High throughput DNA sequencing</topic><topic>Microhaplotype</topic><topic>Parentage</topic><topic>Relationship Inference</topic><toplevel>online_resources</toplevel><creatorcontrib>Baetscher, Diana S.</creatorcontrib><creatorcontrib>Clemento, Anthony J.</creatorcontrib><creatorcontrib>Ng, Thomas C.</creatorcontrib><creatorcontrib>Anderson, Eric C.</creatorcontrib><creatorcontrib>Garza, John C.</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Baetscher, Diana S.</au><au>Clemento, Anthony J.</au><au>Ng, Thomas C.</au><au>Anderson, Eric C.</au><au>Garza, John C.</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference</title><date>2017-11-14</date><risdate>2017</risdate><abstract>The accelerating rate at which DNA sequence data is now generated by
high-throughput sequencing instruments provides both opportunities and
challenges for population genetic and ecological investigations of animals
and plants. We show here how the common practice of calling genotypes from
a single SNP per sequenced region ignores substantial additional
information in the phased short-read sequences that are provided by
high-throughput sequencing instruments. We target sequenced regions with
multiple SNPs in kelp rockfish (Sebastes atrovirens) to determine
“microhaplotypes” and then call these microhaplotypes as alleles at each
locus. We then demonstrate how these multi-allelic marker data from 96
such loci dramatically increase power for relationship inference. The
microhaplotype approach decreases false positive rates by several orders
of magnitude, relative to calling bi-allelic SNPs, for two challenging
analytical procedures, full sibling and single parent-offspring pair
identification. The advent of phased short-read DNA sequence data, in
conjunction with emerging analytical tools for their analysis, promises to
improve efficiency by reducing the number of loci necessary for a
particular level of statistical confidence, thereby lowering the cost of
data collection and reducing the degree of physical linkage amongst
markers used for relationship estimation. Such advances will facilitate
collaborative research and management for migratory and other widespread
species.</abstract><pub>Dryad</pub><doi>10.5061/dryad.5863d</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.5061/dryad.5863d |
ispartof | |
issn | |
language | eng |
recordid | cdi_datacite_primary_10_5061_dryad_5863d |
source | DataCite |
subjects | current High throughput DNA sequencing Microhaplotype Parentage Relationship Inference |
title | Data from: Microhaplotypes provide increased power from short-read DNA sequences for relationship inference |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T13%3A43%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Baetscher,%20Diana%20S.&rft.date=2017-11-14&rft_id=info:doi/10.5061/dryad.5863d&rft_dat=%3Cdatacite_PQ8%3E10_5061_dryad_5863d%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |