An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals

Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we presen...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Royal Society open science 2018-02, Vol.5 (2), p.171589-171589
Hauptverfasser: Li, Yu-Long, Xue, Dong-Xiu, Zhang, Bai-Dong, Liu, Jin-Xian
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 171589
container_issue 2
container_start_page 171589
container_title Royal Society open science
container_volume 5
creator Li, Yu-Long
Xue, Dong-Xiu
Zhang, Bai-Dong
Liu, Jin-Xian
description Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.
doi_str_mv 10.1098/rsos.171589
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1098_rsos_171589</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_f033b580999c454ab53bcb0f78de6740</doaj_id><sourcerecordid>2012116542</sourcerecordid><originalsourceid>FETCH-LOGICAL-c572t-5915520ca465c8853b88ab2786aeaf85e8d20f8f2ae826128efc04f5085c05853</originalsourceid><addsrcrecordid>eNp9kU1r3DAQhk1paUKaU-9Fx0LZdCRbtnwpLEk_AoFA0p56ELI82miRLVeyDZtfX22dht1Dexp9PPOM0JtlbylcUKjFxxB9vKAV5aJ-kZ0y4MWKV5C_PFifZOcxbgGAcsirsnqdnbCap46Knmb9uid-GG1nH7ElahiCV_qBGB-I81o50iLp_eyJihG7xu2IN8TPGFxibb8hg7IB2xX2LblbX5GAqo3EBN-RbnKjHRwS27d2tu2kXHyTvTKp4PlTPct-fPn8_fLb6ub26_Xl-malecXGFa8p5wy0KkquheB5I4RqWCVKhcoIjqJlYIRhCgUrKRNoNBSGg-AaeOLPsuvF23q1lUOwnQo76ZWVfw582EgVRqsdSgN53nABdV3rgheqSdN0A6YSLZZVAcn1aXENU9Nhq7Efg3JH0uOb3j7IjZ8lFzlU5V7w_kkQ_K8J4yg7GzU6p3r0U5QMKKO05AVL6IcF1cHHGNA8j6Eg94HLfeByCTzR7w5f9sz-jTcBsADB79J3e21x3Mmtn0Kftv9w_vxfy9397f3MLZMgcgqcCZrLRzssCi5tjBNKdmg8sv8GXmTYMw</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2012116542</pqid></control><display><type>article</type><title>An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals</title><source>DOAJ Directory of Open Access Journals</source><source>Royal Society Open Access Journals</source><source>PubMed Central Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Li, Yu-Long ; Xue, Dong-Xiu ; Zhang, Bai-Dong ; Liu, Jin-Xian</creator><creatorcontrib>Li, Yu-Long ; Xue, Dong-Xiu ; Zhang, Bai-Dong ; Liu, Jin-Xian</creatorcontrib><description>Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.</description><identifier>ISSN: 2054-5703</identifier><identifier>EISSN: 2054-5703</identifier><identifier>DOI: 10.1098/rsos.171589</identifier><identifier>PMID: 29515871</identifier><language>eng</language><publisher>England: The Royal Society Publishing</publisher><subject>Genetics ; Optimal Clustering ; Optimized Assembly ; Overlapping Paired-End Sequencing ; Pipeline Software ; Rad-Seq</subject><ispartof>Royal Society open science, 2018-02, Vol.5 (2), p.171589-171589</ispartof><rights>2018 The Authors.</rights><rights>2018 The Authors. 2018</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c572t-5915520ca465c8853b88ab2786aeaf85e8d20f8f2ae826128efc04f5085c05853</citedby><cites>FETCH-LOGICAL-c572t-5915520ca465c8853b88ab2786aeaf85e8d20f8f2ae826128efc04f5085c05853</cites><orcidid>0000-0002-1852-9469</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830760/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830760/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,2100,3320,27145,27922,27923,53789,53791,55553,55563</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29515871$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Li, Yu-Long</creatorcontrib><creatorcontrib>Xue, Dong-Xiu</creatorcontrib><creatorcontrib>Zhang, Bai-Dong</creatorcontrib><creatorcontrib>Liu, Jin-Xian</creatorcontrib><title>An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals</title><title>Royal Society open science</title><addtitle>R. Soc. open sci</addtitle><addtitle>R Soc Open Sci</addtitle><description>Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.</description><subject>Genetics</subject><subject>Optimal Clustering</subject><subject>Optimized Assembly</subject><subject>Overlapping Paired-End Sequencing</subject><subject>Pipeline Software</subject><subject>Rad-Seq</subject><issn>2054-5703</issn><issn>2054-5703</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>DOA</sourceid><recordid>eNp9kU1r3DAQhk1paUKaU-9Fx0LZdCRbtnwpLEk_AoFA0p56ELI82miRLVeyDZtfX22dht1Dexp9PPOM0JtlbylcUKjFxxB9vKAV5aJ-kZ0y4MWKV5C_PFifZOcxbgGAcsirsnqdnbCap46Knmb9uid-GG1nH7ElahiCV_qBGB-I81o50iLp_eyJihG7xu2IN8TPGFxibb8hg7IB2xX2LblbX5GAqo3EBN-RbnKjHRwS27d2tu2kXHyTvTKp4PlTPct-fPn8_fLb6ub26_Xl-malecXGFa8p5wy0KkquheB5I4RqWCVKhcoIjqJlYIRhCgUrKRNoNBSGg-AaeOLPsuvF23q1lUOwnQo76ZWVfw582EgVRqsdSgN53nABdV3rgheqSdN0A6YSLZZVAcn1aXENU9Nhq7Efg3JH0uOb3j7IjZ8lFzlU5V7w_kkQ_K8J4yg7GzU6p3r0U5QMKKO05AVL6IcF1cHHGNA8j6Eg94HLfeByCTzR7w5f9sz-jTcBsADB79J3e21x3Mmtn0Kftv9w_vxfy9397f3MLZMgcgqcCZrLRzssCi5tjBNKdmg8sv8GXmTYMw</recordid><startdate>20180201</startdate><enddate>20180201</enddate><creator>Li, Yu-Long</creator><creator>Xue, Dong-Xiu</creator><creator>Zhang, Bai-Dong</creator><creator>Liu, Jin-Xian</creator><general>The Royal Society Publishing</general><general>The Royal Society</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-1852-9469</orcidid></search><sort><creationdate>20180201</creationdate><title>An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals</title><author>Li, Yu-Long ; Xue, Dong-Xiu ; Zhang, Bai-Dong ; Liu, Jin-Xian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c572t-5915520ca465c8853b88ab2786aeaf85e8d20f8f2ae826128efc04f5085c05853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Genetics</topic><topic>Optimal Clustering</topic><topic>Optimized Assembly</topic><topic>Overlapping Paired-End Sequencing</topic><topic>Pipeline Software</topic><topic>Rad-Seq</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Yu-Long</creatorcontrib><creatorcontrib>Xue, Dong-Xiu</creatorcontrib><creatorcontrib>Zhang, Bai-Dong</creatorcontrib><creatorcontrib>Liu, Jin-Xian</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>Royal Society open science</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Yu-Long</au><au>Xue, Dong-Xiu</au><au>Zhang, Bai-Dong</au><au>Liu, Jin-Xian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals</atitle><jtitle>Royal Society open science</jtitle><stitle>R. Soc. open sci</stitle><addtitle>R Soc Open Sci</addtitle><date>2018-02-01</date><risdate>2018</risdate><volume>5</volume><issue>2</issue><spage>171589</spage><epage>171589</epage><pages>171589-171589</pages><issn>2054-5703</issn><eissn>2054-5703</eissn><abstract>Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.</abstract><cop>England</cop><pub>The Royal Society Publishing</pub><pmid>29515871</pmid><doi>10.1098/rsos.171589</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0002-1852-9469</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2054-5703
ispartof Royal Society open science, 2018-02, Vol.5 (2), p.171589-171589
issn 2054-5703
2054-5703
language eng
recordid cdi_crossref_primary_10_1098_rsos_171589
source DOAJ Directory of Open Access Journals; Royal Society Open Access Journals; PubMed Central Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Genetics
Optimal Clustering
Optimized Assembly
Overlapping Paired-End Sequencing
Pipeline Software
Rad-Seq
title An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T03%3A47%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20optimized%20approach%20for%20local%20de%20novo%20assembly%20of%20overlapping%20paired-end%20RAD%20reads%20from%20multiple%20individuals&rft.jtitle=Royal%20Society%20open%20science&rft.au=Li,%20Yu-Long&rft.date=2018-02-01&rft.volume=5&rft.issue=2&rft.spage=171589&rft.epage=171589&rft.pages=171589-171589&rft.issn=2054-5703&rft.eissn=2054-5703&rft_id=info:doi/10.1098/rsos.171589&rft_dat=%3Cproquest_cross%3E2012116542%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2012116542&rft_id=info:pmid/29515871&rft_doaj_id=oai_doaj_org_article_f033b580999c454ab53bcb0f78de6740&rfr_iscdi=true