Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut
Transcriptome resources can facilitate to increase yield and quality of walnuts. Finding the best transcriptome assembly has not been the subject of walnuts research as yet. This research generated 240,179,782 reads from 11 walnut leaves according to cDNA libraries. The reads provided a complete de...
Gespeichert in:
Veröffentlicht in: | PloS one 2020-04, Vol.15 (4), p.e0232005-e0232005 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e0232005 |
---|---|
container_issue | 4 |
container_start_page | e0232005 |
container_title | PloS one |
container_volume | 15 |
creator | Sadat-Hosseini, Mohammad Bakhtiarizadeh, Mohammad Reza Boroomand, Naser Tohidfar, Masoud Vahdati, Kourosh |
description | Transcriptome resources can facilitate to increase yield and quality of walnuts. Finding the best transcriptome assembly has not been the subject of walnuts research as yet. This research generated 240,179,782 reads from 11 walnut leaves according to cDNA libraries. The reads provided a complete de novo transcriptome assembly. Fifteen different transcriptome assemblies were constructed from five different well-known assemblers used in scientific literature with different k-mer lengths (Bridger, BinPacker, SOAPdenovo-Trans, Trinity and SPAdes) as well as two merging approaches (EvidentialGene and Transfuse). Based on the four quality metrics of assembly, the results indicated an efficiency in the process of merging the assemblies after being generated by de novo assemblers. Finally, EvidentialGene was recognized as the best assembler for the de novo assembly of the leaf transcriptome in walnut. Among a total number of 183,191 transcripts which were generated by EvidentialGene, there were 109,413 transcripts capable of protein potential (59.72%) and 104,926 were recognized as ORFs (57.27%). In addition, 79,185 transcripts were predicted to exist with at least one hit to the Pfam database. A number of 3,931 transcription factors were identified by BLAST searching against PlnTFDB. Furthermore, 6,591 of the predicted peptide sequences contained signaling peptides, while 92,704 contained transmembrane domains. Comparison of the assembled transcripts with transcripts of the walnut and published genome assembly for the 'Chandler' cultivar using the BLAST algorithm led to identify a total number of 27,304 and 19,178 homologue transcripts, respectively. De novo transcriptomes in walnut leaves can be developed for the future studies in functional genomics and genetic studies of walnuts. |
doi_str_mv | 10.1371/journal.pone.0232005 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2395605835</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A622217971</galeid><doaj_id>oai_doaj_org_article_2c0b83e449984da0a669d1ff11ddbe29</doaj_id><sourcerecordid>A622217971</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692t-d6ecc9e8a60e2b19b38df72c381a66a32d54e874c6008b8e6ebf274602235bb03</originalsourceid><addsrcrecordid>eNqNk9tq3DAQhk1padK0b1BaQ6G0F7vVwZblm0JYelgIpPQEvRKyPN5VkCVHktPD01ebdcK65KIIJDH65p_RSJNlTzFaYlrhNxdu9Faa5eAsLBGhBKHyXnaMa0oWjCB6_2B_lD0K4SIBlDP2MDuihBa0ovQ4-7FyfaOttptc2xYGSJONeQu5dVculyFA3xgNIY8ud0PUvf4DuQHZ5dFLG5TXQ3Q95K7LP4EPWtr8pzR2jI-zB500AZ5M60n27f27r6uPi7PzD-vV6dlCsZrERctAqRq4ZAhIg-uG8rariKIcS8YkJW1ZAK8KxRDiDQcGTUeqgiFCaNk0iJ5kz_e6g3FBTFUJgtC6ZKjktEzEek-0Tl6Iwete-t_CSS2uDc5vhPRRKwOCKNRwCkVR17xoJUop1C3uOozbtgFSJ623U7Sx6aFVqVhempno_MTqrdi4K1FhzgknSeDVJODd5Qghil4HBcZIC268zptRVNGqSOiLf9C7bzdRG5kuoG3nUly1ExWnjBCCq7rCiVreQaXRQq9V-kKdTvaZw-uZQ2Ii_IobOYYg1l8-_z97_n3OvjxgtyBN3AZnxqidDXOw2IPKuxA8dLdFxkjsOuCmGmLXAWLqgOT27PCBbp1uvjz9C8DpAQg</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2395605835</pqid></control><display><type>article</type><title>Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Sadat-Hosseini, Mohammad ; Bakhtiarizadeh, Mohammad Reza ; Boroomand, Naser ; Tohidfar, Masoud ; Vahdati, Kourosh</creator><contributor>Kumar, Shailesh</contributor><creatorcontrib>Sadat-Hosseini, Mohammad ; Bakhtiarizadeh, Mohammad Reza ; Boroomand, Naser ; Tohidfar, Masoud ; Vahdati, Kourosh ; Kumar, Shailesh</creatorcontrib><description>Transcriptome resources can facilitate to increase yield and quality of walnuts. Finding the best transcriptome assembly has not been the subject of walnuts research as yet. This research generated 240,179,782 reads from 11 walnut leaves according to cDNA libraries. The reads provided a complete de novo transcriptome assembly. Fifteen different transcriptome assemblies were constructed from five different well-known assemblers used in scientific literature with different k-mer lengths (Bridger, BinPacker, SOAPdenovo-Trans, Trinity and SPAdes) as well as two merging approaches (EvidentialGene and Transfuse). Based on the four quality metrics of assembly, the results indicated an efficiency in the process of merging the assemblies after being generated by de novo assemblers. Finally, EvidentialGene was recognized as the best assembler for the de novo assembly of the leaf transcriptome in walnut. Among a total number of 183,191 transcripts which were generated by EvidentialGene, there were 109,413 transcripts capable of protein potential (59.72%) and 104,926 were recognized as ORFs (57.27%). In addition, 79,185 transcripts were predicted to exist with at least one hit to the Pfam database. A number of 3,931 transcription factors were identified by BLAST searching against PlnTFDB. Furthermore, 6,591 of the predicted peptide sequences contained signaling peptides, while 92,704 contained transmembrane domains. Comparison of the assembled transcripts with transcripts of the walnut and published genome assembly for the 'Chandler' cultivar using the BLAST algorithm led to identify a total number of 27,304 and 19,178 homologue transcripts, respectively. De novo transcriptomes in walnut leaves can be developed for the future studies in functional genomics and genetic studies of walnuts.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0232005</identifier><identifier>PMID: 32343733</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Analysis ; Arthropods ; Assemblies ; Assembly ; Biology and Life Sciences ; Computational Biology - methods ; Contig Mapping - methods ; Crop yields ; Cultivars ; DNA binding proteins ; Explosions ; Future predictions ; Gene expression ; Gene Expression Profiling - methods ; Gene Library ; Genomes ; Genomic libraries ; Genomics ; High-Throughput Nucleotide Sequencing - methods ; Homology ; Horticulture ; Juglans - genetics ; Leaves ; Molecular Sequence Annotation ; Peptides ; Plant Leaves - genetics ; Plant Proteins - genetics ; Research and Analysis Methods ; Researchers ; Science literature ; Sequence Analysis, RNA - methods ; Soil sciences ; Studies ; Transcription factors ; Transmembrane domains ; Walnuts</subject><ispartof>PloS one, 2020-04, Vol.15 (4), p.e0232005-e0232005</ispartof><rights>COPYRIGHT 2020 Public Library of Science</rights><rights>2020 Sadat-Hosseini et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2020 Sadat-Hosseini et al 2020 Sadat-Hosseini et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c692t-d6ecc9e8a60e2b19b38df72c381a66a32d54e874c6008b8e6ebf274602235bb03</citedby><cites>FETCH-LOGICAL-c692t-d6ecc9e8a60e2b19b38df72c381a66a32d54e874c6008b8e6ebf274602235bb03</cites><orcidid>0000-0001-8458-8697</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7188282/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7188282/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,2102,2928,23866,27924,27925,53791,53793,79600,79601</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32343733$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Kumar, Shailesh</contributor><creatorcontrib>Sadat-Hosseini, Mohammad</creatorcontrib><creatorcontrib>Bakhtiarizadeh, Mohammad Reza</creatorcontrib><creatorcontrib>Boroomand, Naser</creatorcontrib><creatorcontrib>Tohidfar, Masoud</creatorcontrib><creatorcontrib>Vahdati, Kourosh</creatorcontrib><title>Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>Transcriptome resources can facilitate to increase yield and quality of walnuts. Finding the best transcriptome assembly has not been the subject of walnuts research as yet. This research generated 240,179,782 reads from 11 walnut leaves according to cDNA libraries. The reads provided a complete de novo transcriptome assembly. Fifteen different transcriptome assemblies were constructed from five different well-known assemblers used in scientific literature with different k-mer lengths (Bridger, BinPacker, SOAPdenovo-Trans, Trinity and SPAdes) as well as two merging approaches (EvidentialGene and Transfuse). Based on the four quality metrics of assembly, the results indicated an efficiency in the process of merging the assemblies after being generated by de novo assemblers. Finally, EvidentialGene was recognized as the best assembler for the de novo assembly of the leaf transcriptome in walnut. Among a total number of 183,191 transcripts which were generated by EvidentialGene, there were 109,413 transcripts capable of protein potential (59.72%) and 104,926 were recognized as ORFs (57.27%). In addition, 79,185 transcripts were predicted to exist with at least one hit to the Pfam database. A number of 3,931 transcription factors were identified by BLAST searching against PlnTFDB. Furthermore, 6,591 of the predicted peptide sequences contained signaling peptides, while 92,704 contained transmembrane domains. Comparison of the assembled transcripts with transcripts of the walnut and published genome assembly for the 'Chandler' cultivar using the BLAST algorithm led to identify a total number of 27,304 and 19,178 homologue transcripts, respectively. De novo transcriptomes in walnut leaves can be developed for the future studies in functional genomics and genetic studies of walnuts.</description><subject>Algorithms</subject><subject>Analysis</subject><subject>Arthropods</subject><subject>Assemblies</subject><subject>Assembly</subject><subject>Biology and Life Sciences</subject><subject>Computational Biology - methods</subject><subject>Contig Mapping - methods</subject><subject>Crop yields</subject><subject>Cultivars</subject><subject>DNA binding proteins</subject><subject>Explosions</subject><subject>Future predictions</subject><subject>Gene expression</subject><subject>Gene Expression Profiling - methods</subject><subject>Gene Library</subject><subject>Genomes</subject><subject>Genomic libraries</subject><subject>Genomics</subject><subject>High-Throughput Nucleotide Sequencing - methods</subject><subject>Homology</subject><subject>Horticulture</subject><subject>Juglans - genetics</subject><subject>Leaves</subject><subject>Molecular Sequence Annotation</subject><subject>Peptides</subject><subject>Plant Leaves - genetics</subject><subject>Plant Proteins - genetics</subject><subject>Research and Analysis Methods</subject><subject>Researchers</subject><subject>Science literature</subject><subject>Sequence Analysis, RNA - methods</subject><subject>Soil sciences</subject><subject>Studies</subject><subject>Transcription factors</subject><subject>Transmembrane domains</subject><subject>Walnuts</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqNk9tq3DAQhk1padK0b1BaQ6G0F7vVwZblm0JYelgIpPQEvRKyPN5VkCVHktPD01ebdcK65KIIJDH65p_RSJNlTzFaYlrhNxdu9Faa5eAsLBGhBKHyXnaMa0oWjCB6_2B_lD0K4SIBlDP2MDuihBa0ovQ4-7FyfaOttptc2xYGSJONeQu5dVculyFA3xgNIY8ud0PUvf4DuQHZ5dFLG5TXQ3Q95K7LP4EPWtr8pzR2jI-zB500AZ5M60n27f27r6uPi7PzD-vV6dlCsZrERctAqRq4ZAhIg-uG8rariKIcS8YkJW1ZAK8KxRDiDQcGTUeqgiFCaNk0iJ5kz_e6g3FBTFUJgtC6ZKjktEzEek-0Tl6Iwete-t_CSS2uDc5vhPRRKwOCKNRwCkVR17xoJUop1C3uOozbtgFSJ623U7Sx6aFVqVhempno_MTqrdi4K1FhzgknSeDVJODd5Qghil4HBcZIC268zptRVNGqSOiLf9C7bzdRG5kuoG3nUly1ExWnjBCCq7rCiVreQaXRQq9V-kKdTvaZw-uZQ2Ii_IobOYYg1l8-_z97_n3OvjxgtyBN3AZnxqidDXOw2IPKuxA8dLdFxkjsOuCmGmLXAWLqgOT27PCBbp1uvjz9C8DpAQg</recordid><startdate>20200428</startdate><enddate>20200428</enddate><creator>Sadat-Hosseini, Mohammad</creator><creator>Bakhtiarizadeh, Mohammad Reza</creator><creator>Boroomand, Naser</creator><creator>Tohidfar, Masoud</creator><creator>Vahdati, Kourosh</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-8458-8697</orcidid></search><sort><creationdate>20200428</creationdate><title>Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut</title><author>Sadat-Hosseini, Mohammad ; Bakhtiarizadeh, Mohammad Reza ; Boroomand, Naser ; Tohidfar, Masoud ; Vahdati, Kourosh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692t-d6ecc9e8a60e2b19b38df72c381a66a32d54e874c6008b8e6ebf274602235bb03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Analysis</topic><topic>Arthropods</topic><topic>Assemblies</topic><topic>Assembly</topic><topic>Biology and Life Sciences</topic><topic>Computational Biology - methods</topic><topic>Contig Mapping - methods</topic><topic>Crop yields</topic><topic>Cultivars</topic><topic>DNA binding proteins</topic><topic>Explosions</topic><topic>Future predictions</topic><topic>Gene expression</topic><topic>Gene Expression Profiling - methods</topic><topic>Gene Library</topic><topic>Genomes</topic><topic>Genomic libraries</topic><topic>Genomics</topic><topic>High-Throughput Nucleotide Sequencing - methods</topic><topic>Homology</topic><topic>Horticulture</topic><topic>Juglans - genetics</topic><topic>Leaves</topic><topic>Molecular Sequence Annotation</topic><topic>Peptides</topic><topic>Plant Leaves - genetics</topic><topic>Plant Proteins - genetics</topic><topic>Research and Analysis Methods</topic><topic>Researchers</topic><topic>Science literature</topic><topic>Sequence Analysis, RNA - methods</topic><topic>Soil sciences</topic><topic>Studies</topic><topic>Transcription factors</topic><topic>Transmembrane domains</topic><topic>Walnuts</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sadat-Hosseini, Mohammad</creatorcontrib><creatorcontrib>Bakhtiarizadeh, Mohammad Reza</creatorcontrib><creatorcontrib>Boroomand, Naser</creatorcontrib><creatorcontrib>Tohidfar, Masoud</creatorcontrib><creatorcontrib>Vahdati, Kourosh</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing & Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agricultural Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials Science Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sadat-Hosseini, Mohammad</au><au>Bakhtiarizadeh, Mohammad Reza</au><au>Boroomand, Naser</au><au>Tohidfar, Masoud</au><au>Vahdati, Kourosh</au><au>Kumar, Shailesh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2020-04-28</date><risdate>2020</risdate><volume>15</volume><issue>4</issue><spage>e0232005</spage><epage>e0232005</epage><pages>e0232005-e0232005</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>Transcriptome resources can facilitate to increase yield and quality of walnuts. Finding the best transcriptome assembly has not been the subject of walnuts research as yet. This research generated 240,179,782 reads from 11 walnut leaves according to cDNA libraries. The reads provided a complete de novo transcriptome assembly. Fifteen different transcriptome assemblies were constructed from five different well-known assemblers used in scientific literature with different k-mer lengths (Bridger, BinPacker, SOAPdenovo-Trans, Trinity and SPAdes) as well as two merging approaches (EvidentialGene and Transfuse). Based on the four quality metrics of assembly, the results indicated an efficiency in the process of merging the assemblies after being generated by de novo assemblers. Finally, EvidentialGene was recognized as the best assembler for the de novo assembly of the leaf transcriptome in walnut. Among a total number of 183,191 transcripts which were generated by EvidentialGene, there were 109,413 transcripts capable of protein potential (59.72%) and 104,926 were recognized as ORFs (57.27%). In addition, 79,185 transcripts were predicted to exist with at least one hit to the Pfam database. A number of 3,931 transcription factors were identified by BLAST searching against PlnTFDB. Furthermore, 6,591 of the predicted peptide sequences contained signaling peptides, while 92,704 contained transmembrane domains. Comparison of the assembled transcripts with transcripts of the walnut and published genome assembly for the 'Chandler' cultivar using the BLAST algorithm led to identify a total number of 27,304 and 19,178 homologue transcripts, respectively. De novo transcriptomes in walnut leaves can be developed for the future studies in functional genomics and genetic studies of walnuts.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>32343733</pmid><doi>10.1371/journal.pone.0232005</doi><orcidid>https://orcid.org/0000-0001-8458-8697</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2020-04, Vol.15 (4), p.e0232005-e0232005 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_2395605835 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; EZB-FREE-00999 freely available EZB journals; PubMed Central; Free Full-Text Journals in Chemistry |
subjects | Algorithms Analysis Arthropods Assemblies Assembly Biology and Life Sciences Computational Biology - methods Contig Mapping - methods Crop yields Cultivars DNA binding proteins Explosions Future predictions Gene expression Gene Expression Profiling - methods Gene Library Genomes Genomic libraries Genomics High-Throughput Nucleotide Sequencing - methods Homology Horticulture Juglans - genetics Leaves Molecular Sequence Annotation Peptides Plant Leaves - genetics Plant Proteins - genetics Research and Analysis Methods Researchers Science literature Sequence Analysis, RNA - methods Soil sciences Studies Transcription factors Transmembrane domains Walnuts |
title | Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T04%3A17%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Combining%20independent%20de%20novo%20assemblies%20to%20optimize%20leaf%20transcriptome%20of%20Persian%20walnut&rft.jtitle=PloS%20one&rft.au=Sadat-Hosseini,%20Mohammad&rft.date=2020-04-28&rft.volume=15&rft.issue=4&rft.spage=e0232005&rft.epage=e0232005&rft.pages=e0232005-e0232005&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0232005&rft_dat=%3Cgale_plos_%3EA622217971%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2395605835&rft_id=info:pmid/32343733&rft_galeid=A622217971&rft_doaj_id=oai_doaj_org_article_2c0b83e449984da0a669d1ff11ddbe29&rfr_iscdi=true |