Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"
Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (> 20 kb) and have many repetiti...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , , , , , , |
---|---|
Format: | Dataset |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Akito, Kawahara Y Caroline, Storer G Markee Amanda Jacqueline, Heckenhauer Ashlyn, Powell David, Plotkin Scott, Hotaling Timothy, Cleland P Dikow B Rebecca Torsten, Dikow Ryoichi, Kuranishi B Messcher Rebeccah Steffen, Pauls U Russell, Stewart J Koji, Tojo Paul, Frandsen B |
description | Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (> 20 kb) and have many repetitive motifs. These features make these genes challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly, Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO Complete = 99.3%/95.5%), with complete and contiguous recovery of silk heavy fibroin gene sequences. This study demonstrates that HiFi long-read sequencing can significantly help our understanding of genes with highly contiguous, repetitive regions. |
doi_str_mv | 10.5524/102229 |
format | Dataset |
fullrecord | <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5524_102229</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5524_102229</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5524_1022293</originalsourceid><addsrcrecordid>eNqNj70OgkAQhK-xMP48w4bCDgXUwtpoLOy0J-vdghvhDvdODY_gWwvRB7CaTObLZEapaZrM1-tstUiTLMs2Q_U-PZrGSWBbgsGAUDiB6OhsGQuhgQPvGTzdH2R1z2gnQjpULaD3VF8q8iDUUODAT4Ir4bOFgi_i2ILn6gYl2Y7pnKUX1C5cAa0BjcawL7qeLnc1-WisBgVWniY_HanZfnfeHuJ-luZAeSNco7R5muT9h_z7Yfk3-AHVpFKo</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"</title><source>DataCite</source><creator>Akito, Kawahara Y ; Caroline, Storer G ; Markee Amanda ; Jacqueline, Heckenhauer ; Ashlyn, Powell ; David, Plotkin ; Scott, Hotaling ; Timothy, Cleland P ; Dikow B Rebecca ; Torsten, Dikow ; Ryoichi, Kuranishi B ; Messcher Rebeccah ; Steffen, Pauls U ; Russell, Stewart J ; Koji, Tojo ; Paul, Frandsen B</creator><creatorcontrib>Akito, Kawahara Y ; Caroline, Storer G ; Markee Amanda ; Jacqueline, Heckenhauer ; Ashlyn, Powell ; David, Plotkin ; Scott, Hotaling ; Timothy, Cleland P ; Dikow B Rebecca ; Torsten, Dikow ; Ryoichi, Kuranishi B ; Messcher Rebeccah ; Steffen, Pauls U ; Russell, Stewart J ; Koji, Tojo ; Paul, Frandsen B</creatorcontrib><description>Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (> 20 kb) and have many repetitive motifs. These features make these genes challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly, Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO Complete = 99.3%/95.5%), with complete and contiguous recovery of silk heavy fibroin gene sequences. This study demonstrates that HiFi long-read sequencing can significantly help our understanding of genes with highly contiguous, repetitive regions.</description><identifier>DOI: 10.5524/102229</identifier><language>eng</language><publisher>GigaScience Database</publisher><subject>caddisfly ; data description ; Genomic ; indianmeal moth ; lepidoptera ; moth ; pacbio ; Software ; transcriptome ; Transcriptomic ; trichoptera</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-3724-4610 ; 0000-0001-8771-9154 ; 0000-0001-9198-2828 ; 0000-0002-6451-3425 ; 0000-0002-9362-604X ; 0000-0003-4816-2909 ; 0000-0002-0349-0653 ; 0000-0002-2339-655X ; 0000-0002-6353-0450 ; 0000-0002-4801-7579 ; 0000-0002-8389-8877 ; 0000-0002-5965-0986</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,1887</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5524/102229$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Akito, Kawahara Y</creatorcontrib><creatorcontrib>Caroline, Storer G</creatorcontrib><creatorcontrib>Markee Amanda</creatorcontrib><creatorcontrib>Jacqueline, Heckenhauer</creatorcontrib><creatorcontrib>Ashlyn, Powell</creatorcontrib><creatorcontrib>David, Plotkin</creatorcontrib><creatorcontrib>Scott, Hotaling</creatorcontrib><creatorcontrib>Timothy, Cleland P</creatorcontrib><creatorcontrib>Dikow B Rebecca</creatorcontrib><creatorcontrib>Torsten, Dikow</creatorcontrib><creatorcontrib>Ryoichi, Kuranishi B</creatorcontrib><creatorcontrib>Messcher Rebeccah</creatorcontrib><creatorcontrib>Steffen, Pauls U</creatorcontrib><creatorcontrib>Russell, Stewart J</creatorcontrib><creatorcontrib>Koji, Tojo</creatorcontrib><creatorcontrib>Paul, Frandsen B</creatorcontrib><title>Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"</title><description>Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (> 20 kb) and have many repetitive motifs. These features make these genes challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly, Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO Complete = 99.3%/95.5%), with complete and contiguous recovery of silk heavy fibroin gene sequences. This study demonstrates that HiFi long-read sequencing can significantly help our understanding of genes with highly contiguous, repetitive regions.</description><subject>caddisfly</subject><subject>data description</subject><subject>Genomic</subject><subject>indianmeal moth</subject><subject>lepidoptera</subject><subject>moth</subject><subject>pacbio</subject><subject>Software</subject><subject>transcriptome</subject><subject>Transcriptomic</subject><subject>trichoptera</subject><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2022</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqNj70OgkAQhK-xMP48w4bCDgXUwtpoLOy0J-vdghvhDvdODY_gWwvRB7CaTObLZEapaZrM1-tstUiTLMs2Q_U-PZrGSWBbgsGAUDiB6OhsGQuhgQPvGTzdH2R1z2gnQjpULaD3VF8q8iDUUODAT4Ir4bOFgi_i2ILn6gYl2Y7pnKUX1C5cAa0BjcawL7qeLnc1-WisBgVWniY_HanZfnfeHuJ-luZAeSNco7R5muT9h_z7Yfk3-AHVpFKo</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Akito, Kawahara Y</creator><creator>Caroline, Storer G</creator><creator>Markee Amanda</creator><creator>Jacqueline, Heckenhauer</creator><creator>Ashlyn, Powell</creator><creator>David, Plotkin</creator><creator>Scott, Hotaling</creator><creator>Timothy, Cleland P</creator><creator>Dikow B Rebecca</creator><creator>Torsten, Dikow</creator><creator>Ryoichi, Kuranishi B</creator><creator>Messcher Rebeccah</creator><creator>Steffen, Pauls U</creator><creator>Russell, Stewart J</creator><creator>Koji, Tojo</creator><creator>Paul, Frandsen B</creator><general>GigaScience Database</general><scope>DYCCY</scope><scope>PQ8</scope><orcidid>https://orcid.org/0000-0002-3724-4610</orcidid><orcidid>https://orcid.org/0000-0001-8771-9154</orcidid><orcidid>https://orcid.org/0000-0001-9198-2828</orcidid><orcidid>https://orcid.org/0000-0002-6451-3425</orcidid><orcidid>https://orcid.org/0000-0002-9362-604X</orcidid><orcidid>https://orcid.org/0000-0003-4816-2909</orcidid><orcidid>https://orcid.org/0000-0002-0349-0653</orcidid><orcidid>https://orcid.org/0000-0002-2339-655X</orcidid><orcidid>https://orcid.org/0000-0002-6353-0450</orcidid><orcidid>https://orcid.org/0000-0002-4801-7579</orcidid><orcidid>https://orcid.org/0000-0002-8389-8877</orcidid><orcidid>https://orcid.org/0000-0002-5965-0986</orcidid></search><sort><creationdate>2022</creationdate><title>Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"</title><author>Akito, Kawahara Y ; Caroline, Storer G ; Markee Amanda ; Jacqueline, Heckenhauer ; Ashlyn, Powell ; David, Plotkin ; Scott, Hotaling ; Timothy, Cleland P ; Dikow B Rebecca ; Torsten, Dikow ; Ryoichi, Kuranishi B ; Messcher Rebeccah ; Steffen, Pauls U ; Russell, Stewart J ; Koji, Tojo ; Paul, Frandsen B</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5524_1022293</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2022</creationdate><topic>caddisfly</topic><topic>data description</topic><topic>Genomic</topic><topic>indianmeal moth</topic><topic>lepidoptera</topic><topic>moth</topic><topic>pacbio</topic><topic>Software</topic><topic>transcriptome</topic><topic>Transcriptomic</topic><topic>trichoptera</topic><toplevel>online_resources</toplevel><creatorcontrib>Akito, Kawahara Y</creatorcontrib><creatorcontrib>Caroline, Storer G</creatorcontrib><creatorcontrib>Markee Amanda</creatorcontrib><creatorcontrib>Jacqueline, Heckenhauer</creatorcontrib><creatorcontrib>Ashlyn, Powell</creatorcontrib><creatorcontrib>David, Plotkin</creatorcontrib><creatorcontrib>Scott, Hotaling</creatorcontrib><creatorcontrib>Timothy, Cleland P</creatorcontrib><creatorcontrib>Dikow B Rebecca</creatorcontrib><creatorcontrib>Torsten, Dikow</creatorcontrib><creatorcontrib>Ryoichi, Kuranishi B</creatorcontrib><creatorcontrib>Messcher Rebeccah</creatorcontrib><creatorcontrib>Steffen, Pauls U</creatorcontrib><creatorcontrib>Russell, Stewart J</creatorcontrib><creatorcontrib>Koji, Tojo</creatorcontrib><creatorcontrib>Paul, Frandsen B</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Akito, Kawahara Y</au><au>Caroline, Storer G</au><au>Markee Amanda</au><au>Jacqueline, Heckenhauer</au><au>Ashlyn, Powell</au><au>David, Plotkin</au><au>Scott, Hotaling</au><au>Timothy, Cleland P</au><au>Dikow B Rebecca</au><au>Torsten, Dikow</au><au>Ryoichi, Kuranishi B</au><au>Messcher Rebeccah</au><au>Steffen, Pauls U</au><au>Russell, Stewart J</au><au>Koji, Tojo</au><au>Paul, Frandsen B</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes"</title><date>2022</date><risdate>2022</risdate><abstract>Insect silk is an incredibly versatile biomaterial. Lepidoptera and their sister lineage, Trichoptera, display some of the most diverse uses of silk with varying strength, adhesive qualities and elastic properties. It is well known that silk fibroin genes are long (> 20 kb) and have many repetitive motifs. These features make these genes challenging to sequence. Most research thus far has focused on conserved N- and C-terminal regions of fibroin genes because a full comparison of repetitive regions across taxa has not been possible. Using the PacBio Sequel II system and SMRT sequencing, we generated high fidelity (HiFi) long-read genomic and transcriptomic sequences for the Indianmeal moth (Plodia interpunctella) and genomic sequences for the caddisfly, Eubasilissa regina. Both genomes were highly contiguous (N50 = 9.7 Mbp/32.4 Mbp, L50 = 13/11) and complete (BUSCO Complete = 99.3%/95.5%), with complete and contiguous recovery of silk heavy fibroin gene sequences. This study demonstrates that HiFi long-read sequencing can significantly help our understanding of genes with highly contiguous, repetitive regions.</abstract><pub>GigaScience Database</pub><doi>10.5524/102229</doi><orcidid>https://orcid.org/0000-0002-3724-4610</orcidid><orcidid>https://orcid.org/0000-0001-8771-9154</orcidid><orcidid>https://orcid.org/0000-0001-9198-2828</orcidid><orcidid>https://orcid.org/0000-0002-6451-3425</orcidid><orcidid>https://orcid.org/0000-0002-9362-604X</orcidid><orcidid>https://orcid.org/0000-0003-4816-2909</orcidid><orcidid>https://orcid.org/0000-0002-0349-0653</orcidid><orcidid>https://orcid.org/0000-0002-2339-655X</orcidid><orcidid>https://orcid.org/0000-0002-6353-0450</orcidid><orcidid>https://orcid.org/0000-0002-4801-7579</orcidid><orcidid>https://orcid.org/0000-0002-8389-8877</orcidid><orcidid>https://orcid.org/0000-0002-5965-0986</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.5524/102229 |
ispartof | |
issn | |
language | eng |
recordid | cdi_datacite_primary_10_5524_102229 |
source | DataCite |
subjects | caddisfly data description Genomic indianmeal moth lepidoptera moth pacbio Software transcriptome Transcriptomic trichoptera |
title | Supporting data for "Long-read HiFi sequencing correctly assembles repetitive heavy fibroin silk genes in new moth and caddisfly genomes" |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T00%3A02%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Akito,%20Kawahara%20Y&rft.date=2022&rft_id=info:doi/10.5524/102229&rft_dat=%3Cdatacite_PQ8%3E10_5524_102229%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |