Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data

Whole genome sequencing enables us to ask fundamental questions about the genetic basis of adaptation, population structure, and epigenetic mechanisms, but usually requires a suitable reference genome for making sense of the sequence data. While the availability of reference genomes has significantl...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Thorburn, Doko-Miles, Sagonas, Kostas, Binzer-Panchal, Mahesh, Chain, Frederic, Feulner, Philine, Bornberg-Bauer, Erich, Reusch, Thorsten, Samonte-Padilla, Irene, Milinski, Manfred, Lenz, Tobias, Eizaguirre, Christophe
Format: Dataset
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Thorburn, Doko-Miles
Sagonas, Kostas
Binzer-Panchal, Mahesh
Chain, Frederic
Feulner, Philine
Bornberg-Bauer, Erich
Reusch, Thorsten
Samonte-Padilla, Irene
Milinski, Manfred
Lenz, Tobias
Eizaguirre, Christophe
description Whole genome sequencing enables us to ask fundamental questions about the genetic basis of adaptation, population structure, and epigenetic mechanisms, but usually requires a suitable reference genome for making sense of the sequence data. While the availability of reference genomes has significantly improvement in both taxonomic coverage and overall quality, this poses a challenge for researchers in determining which reference genome best suits their data. Here we compare the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome from a European individual and the published reference genome of a North American individual. Specifically, we investigate the impact of using a local reference versus one generated from a differentiated population on several commonly used metrics in population genomics. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we confirmed genome quality is an important factor in choosing a reference genome. A local reference genome did offer increased mapping efficiency and genotyping accuracy, likely stemming from the higher similarity in genome sequence and synteny. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e., π, Tajima’s D, and FST), window-based statistics using different references resulted in different outlier genes and enriched gene functions. In contrast, the marker-based analysis utilising DNA methylation distributions had a considerably higher overlap in outlier genes and functions when using different reference genomes. Overall, our results highlight how using a local reference genome can increase the resolution of genome scans when multiple similar-quality reference genomes are available. Such results have implications in the detection of signatures of selection.
doi_str_mv 10.5061/dryad.ncjsxksx8
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5061_dryad_ncjsxksx8</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5061_dryad_ncjsxksx8</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5061_dryad_ncjsxksx83</originalsourceid><addsrcrecordid>eNqVjrsOwjAMRbMwIGBm9Q_QNkIgZhCvjYE9slK3CqQOSlKgf09aIXYW-x4_pCPEXBbZqljLvPQdlhnrW3jfw3szFt0RQyTvUmkDoG4tYUyp7tjVxBSNBk8VeWJNkCauIUAuoWpZR-MYbUJ2EXsI8CQfUgA5HHl8wQX11rgBz9a2jWGEEiNOxahCG2j27RORH_bX3WnRL7WJpB7eNOg7JQvVy6tBXv3kl_9_fADoElv1</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data</title><source>DataCite</source><creator>Thorburn, Doko-Miles ; Sagonas, Kostas ; Binzer-Panchal, Mahesh ; Chain, Frederic ; Feulner, Philine ; Bornberg-Bauer, Erich ; Reusch, Thorsten ; Samonte-Padilla, Irene ; Milinski, Manfred ; Lenz, Tobias ; Eizaguirre, Christophe</creator><creatorcontrib>Thorburn, Doko-Miles ; Sagonas, Kostas ; Binzer-Panchal, Mahesh ; Chain, Frederic ; Feulner, Philine ; Bornberg-Bauer, Erich ; Reusch, Thorsten ; Samonte-Padilla, Irene ; Milinski, Manfred ; Lenz, Tobias ; Eizaguirre, Christophe</creatorcontrib><description>Whole genome sequencing enables us to ask fundamental questions about the genetic basis of adaptation, population structure, and epigenetic mechanisms, but usually requires a suitable reference genome for making sense of the sequence data. While the availability of reference genomes has significantly improvement in both taxonomic coverage and overall quality, this poses a challenge for researchers in determining which reference genome best suits their data. Here we compare the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome from a European individual and the published reference genome of a North American individual. Specifically, we investigate the impact of using a local reference versus one generated from a differentiated population on several commonly used metrics in population genomics. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we confirmed genome quality is an important factor in choosing a reference genome. A local reference genome did offer increased mapping efficiency and genotyping accuracy, likely stemming from the higher similarity in genome sequence and synteny. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e., π, Tajima’s D, and FST), window-based statistics using different references resulted in different outlier genes and enriched gene functions. In contrast, the marker-based analysis utilising DNA methylation distributions had a considerably higher overlap in outlier genes and functions when using different reference genomes. Overall, our results highlight how using a local reference genome can increase the resolution of genome scans when multiple similar-quality reference genomes are available. Such results have implications in the detection of signatures of selection.</description><identifier>DOI: 10.5061/dryad.ncjsxksx8</identifier><language>eng</language><publisher>Dryad</publisher><subject>FOS: Biological sciences ; genome assembly ; gynogenetic ; Reference genomes ; reference mapping bias ; stickleback</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-0120-8829</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,1888</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5061/dryad.ncjsxksx8$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Thorburn, Doko-Miles</creatorcontrib><creatorcontrib>Sagonas, Kostas</creatorcontrib><creatorcontrib>Binzer-Panchal, Mahesh</creatorcontrib><creatorcontrib>Chain, Frederic</creatorcontrib><creatorcontrib>Feulner, Philine</creatorcontrib><creatorcontrib>Bornberg-Bauer, Erich</creatorcontrib><creatorcontrib>Reusch, Thorsten</creatorcontrib><creatorcontrib>Samonte-Padilla, Irene</creatorcontrib><creatorcontrib>Milinski, Manfred</creatorcontrib><creatorcontrib>Lenz, Tobias</creatorcontrib><creatorcontrib>Eizaguirre, Christophe</creatorcontrib><title>Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data</title><description>Whole genome sequencing enables us to ask fundamental questions about the genetic basis of adaptation, population structure, and epigenetic mechanisms, but usually requires a suitable reference genome for making sense of the sequence data. While the availability of reference genomes has significantly improvement in both taxonomic coverage and overall quality, this poses a challenge for researchers in determining which reference genome best suits their data. Here we compare the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome from a European individual and the published reference genome of a North American individual. Specifically, we investigate the impact of using a local reference versus one generated from a differentiated population on several commonly used metrics in population genomics. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we confirmed genome quality is an important factor in choosing a reference genome. A local reference genome did offer increased mapping efficiency and genotyping accuracy, likely stemming from the higher similarity in genome sequence and synteny. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e., π, Tajima’s D, and FST), window-based statistics using different references resulted in different outlier genes and enriched gene functions. In contrast, the marker-based analysis utilising DNA methylation distributions had a considerably higher overlap in outlier genes and functions when using different reference genomes. Overall, our results highlight how using a local reference genome can increase the resolution of genome scans when multiple similar-quality reference genomes are available. Such results have implications in the detection of signatures of selection.</description><subject>FOS: Biological sciences</subject><subject>genome assembly</subject><subject>gynogenetic</subject><subject>Reference genomes</subject><subject>reference mapping bias</subject><subject>stickleback</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2023</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqVjrsOwjAMRbMwIGBm9Q_QNkIgZhCvjYE9slK3CqQOSlKgf09aIXYW-x4_pCPEXBbZqljLvPQdlhnrW3jfw3szFt0RQyTvUmkDoG4tYUyp7tjVxBSNBk8VeWJNkCauIUAuoWpZR-MYbUJ2EXsI8CQfUgA5HHl8wQX11rgBz9a2jWGEEiNOxahCG2j27RORH_bX3WnRL7WJpB7eNOg7JQvVy6tBXv3kl_9_fADoElv1</recordid><startdate>20230120</startdate><enddate>20230120</enddate><creator>Thorburn, Doko-Miles</creator><creator>Sagonas, Kostas</creator><creator>Binzer-Panchal, Mahesh</creator><creator>Chain, Frederic</creator><creator>Feulner, Philine</creator><creator>Bornberg-Bauer, Erich</creator><creator>Reusch, Thorsten</creator><creator>Samonte-Padilla, Irene</creator><creator>Milinski, Manfred</creator><creator>Lenz, Tobias</creator><creator>Eizaguirre, Christophe</creator><general>Dryad</general><scope>DYCCY</scope><scope>PQ8</scope><orcidid>https://orcid.org/0000-0002-0120-8829</orcidid></search><sort><creationdate>20230120</creationdate><title>Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data</title><author>Thorburn, Doko-Miles ; Sagonas, Kostas ; Binzer-Panchal, Mahesh ; Chain, Frederic ; Feulner, Philine ; Bornberg-Bauer, Erich ; Reusch, Thorsten ; Samonte-Padilla, Irene ; Milinski, Manfred ; Lenz, Tobias ; Eizaguirre, Christophe</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5061_dryad_ncjsxksx83</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2023</creationdate><topic>FOS: Biological sciences</topic><topic>genome assembly</topic><topic>gynogenetic</topic><topic>Reference genomes</topic><topic>reference mapping bias</topic><topic>stickleback</topic><toplevel>online_resources</toplevel><creatorcontrib>Thorburn, Doko-Miles</creatorcontrib><creatorcontrib>Sagonas, Kostas</creatorcontrib><creatorcontrib>Binzer-Panchal, Mahesh</creatorcontrib><creatorcontrib>Chain, Frederic</creatorcontrib><creatorcontrib>Feulner, Philine</creatorcontrib><creatorcontrib>Bornberg-Bauer, Erich</creatorcontrib><creatorcontrib>Reusch, Thorsten</creatorcontrib><creatorcontrib>Samonte-Padilla, Irene</creatorcontrib><creatorcontrib>Milinski, Manfred</creatorcontrib><creatorcontrib>Lenz, Tobias</creatorcontrib><creatorcontrib>Eizaguirre, Christophe</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Thorburn, Doko-Miles</au><au>Sagonas, Kostas</au><au>Binzer-Panchal, Mahesh</au><au>Chain, Frederic</au><au>Feulner, Philine</au><au>Bornberg-Bauer, Erich</au><au>Reusch, Thorsten</au><au>Samonte-Padilla, Irene</au><au>Milinski, Manfred</au><au>Lenz, Tobias</au><au>Eizaguirre, Christophe</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data</title><date>2023-01-20</date><risdate>2023</risdate><abstract>Whole genome sequencing enables us to ask fundamental questions about the genetic basis of adaptation, population structure, and epigenetic mechanisms, but usually requires a suitable reference genome for making sense of the sequence data. While the availability of reference genomes has significantly improvement in both taxonomic coverage and overall quality, this poses a challenge for researchers in determining which reference genome best suits their data. Here we compare the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome from a European individual and the published reference genome of a North American individual. Specifically, we investigate the impact of using a local reference versus one generated from a differentiated population on several commonly used metrics in population genomics. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we confirmed genome quality is an important factor in choosing a reference genome. A local reference genome did offer increased mapping efficiency and genotyping accuracy, likely stemming from the higher similarity in genome sequence and synteny. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e., π, Tajima’s D, and FST), window-based statistics using different references resulted in different outlier genes and enriched gene functions. In contrast, the marker-based analysis utilising DNA methylation distributions had a considerably higher overlap in outlier genes and functions when using different reference genomes. Overall, our results highlight how using a local reference genome can increase the resolution of genome scans when multiple similar-quality reference genomes are available. Such results have implications in the detection of signatures of selection.</abstract><pub>Dryad</pub><doi>10.5061/dryad.ncjsxksx8</doi><orcidid>https://orcid.org/0000-0002-0120-8829</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5061/dryad.ncjsxksx8
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5061_dryad_ncjsxksx8
source DataCite
subjects FOS: Biological sciences
genome assembly
gynogenetic
Reference genomes
reference mapping bias
stickleback
title Gasterosteus aculeatus gynogenetic reference genome and functional annotations version 1 and raw PacBio and Illumina data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T12%3A47%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Thorburn,%20Doko-Miles&rft.date=2023-01-20&rft_id=info:doi/10.5061/dryad.ncjsxksx8&rft_dat=%3Cdatacite_PQ8%3E10_5061_dryad_ncjsxksx8%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true