A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data

[Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLI...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular phylogenetics and evolution 2017-04, Vol.109, p.409-414
Hauptverfasser: Shimada, Makoto K., Nishida, Tsunetoshi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 414
container_issue
container_start_page 409
container_title Molecular phylogenetics and evolution
container_volume 109
creator Shimada, Makoto K.
Nishida, Tsunetoshi
description [Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data. Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.
doi_str_mv 10.1016/j.ympev.2017.02.012
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1872579866</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1055790316304523</els_id><sourcerecordid>1872579866</sourcerecordid><originalsourceid>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</originalsourceid><addsrcrecordid>eNp9kctuFDEQRS0EIiHwBUjISxbpxo-4H0gsRhGQSCORBSxYWW67PHjUthvbHSn_xEfimQlZsijVQ6duqXQRektJSwntPuzbB7_AfcsI7VvCWkLZM3ROySiaUVD-_FAL0fQj4WfoVc57QigVo3iJztjAOKPjcI7-bLCPxlmnVXEx4Ghx-QX47ubn9vYOLynukvIf8QbnOK9HwsZ0RBKYNRgVCtbzmgukAz3N4C-xCqYGdn6pLYTyJF2Hai3R14HGU4wll6SWxYUdrkBJABm7YCFVcWxT9Dgmt3NBzdiool6jF1bNGd485gv048vn79c3zfbb19vrzbbRV0yUhvUwjtzSSQ9TJwAE51d2Ggiz2hBi-qHTbBqYAK0GqzvCNaVGj7azxNa-4xfo_Um3fvR7hVykd1nDPKsAcc2SDj0T_TgcUX5CdYo5J7BySc6r9CApkQeb5F4ebZIHmyRhstpUt949HlgnD-Zp558vFfh0AqC-ee8gyawdBA3GJdBFmuj-e-AvujOo6Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1872579866</pqid></control><display><type>article</type><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Shimada, Makoto K. ; Nishida, Tsunetoshi</creator><creatorcontrib>Shimada, Makoto K. ; Nishida, Tsunetoshi</creatorcontrib><description>[Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data. Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</description><identifier>ISSN: 1055-7903</identifier><identifier>EISSN: 1095-9513</identifier><identifier>DOI: 10.1016/j.ympev.2017.02.012</identifier><identifier>PMID: 28232198</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Bootstrap analysis ; Floating point numbers ; Hash table ; Originally inferred tree ; PHYLIP ; Phylogeny ; Software</subject><ispartof>Molecular phylogenetics and evolution, 2017-04, Vol.109, p.409-414</ispartof><rights>2017 Elsevier Inc.</rights><rights>Copyright © 2017 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</citedby><cites>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1055790316304523$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28232198$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Shimada, Makoto K.</creatorcontrib><creatorcontrib>Nishida, Tsunetoshi</creatorcontrib><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><title>Molecular phylogenetics and evolution</title><addtitle>Mol Phylogenet Evol</addtitle><description>[Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data. Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</description><subject>Bootstrap analysis</subject><subject>Floating point numbers</subject><subject>Hash table</subject><subject>Originally inferred tree</subject><subject>PHYLIP</subject><subject>Phylogeny</subject><subject>Software</subject><issn>1055-7903</issn><issn>1095-9513</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kctuFDEQRS0EIiHwBUjISxbpxo-4H0gsRhGQSCORBSxYWW67PHjUthvbHSn_xEfimQlZsijVQ6duqXQRektJSwntPuzbB7_AfcsI7VvCWkLZM3ROySiaUVD-_FAL0fQj4WfoVc57QigVo3iJztjAOKPjcI7-bLCPxlmnVXEx4Ghx-QX47ubn9vYOLynukvIf8QbnOK9HwsZ0RBKYNRgVCtbzmgukAz3N4C-xCqYGdn6pLYTyJF2Hai3R14HGU4wll6SWxYUdrkBJABm7YCFVcWxT9Dgmt3NBzdiool6jF1bNGd485gv048vn79c3zfbb19vrzbbRV0yUhvUwjtzSSQ9TJwAE51d2Ggiz2hBi-qHTbBqYAK0GqzvCNaVGj7azxNa-4xfo_Um3fvR7hVykd1nDPKsAcc2SDj0T_TgcUX5CdYo5J7BySc6r9CApkQeb5F4ebZIHmyRhstpUt949HlgnD-Zp558vFfh0AqC-ee8gyawdBA3GJdBFmuj-e-AvujOo6Q</recordid><startdate>201704</startdate><enddate>201704</enddate><creator>Shimada, Makoto K.</creator><creator>Nishida, Tsunetoshi</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201704</creationdate><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><author>Shimada, Makoto K. ; Nishida, Tsunetoshi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Bootstrap analysis</topic><topic>Floating point numbers</topic><topic>Hash table</topic><topic>Originally inferred tree</topic><topic>PHYLIP</topic><topic>Phylogeny</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shimada, Makoto K.</creatorcontrib><creatorcontrib>Nishida, Tsunetoshi</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Molecular phylogenetics and evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shimada, Makoto K.</au><au>Nishida, Tsunetoshi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</atitle><jtitle>Molecular phylogenetics and evolution</jtitle><addtitle>Mol Phylogenet Evol</addtitle><date>2017-04</date><risdate>2017</risdate><volume>109</volume><spage>409</spage><epage>414</epage><pages>409-414</pages><issn>1055-7903</issn><eissn>1095-9513</eissn><abstract>[Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data. Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>28232198</pmid><doi>10.1016/j.ympev.2017.02.012</doi><tpages>6</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1055-7903
ispartof Molecular phylogenetics and evolution, 2017-04, Vol.109, p.409-414
issn 1055-7903
1095-9513
language eng
recordid cdi_proquest_miscellaneous_1872579866
source MEDLINE; ScienceDirect Journals (5 years ago - present)
subjects Bootstrap analysis
Floating point numbers
Hash table
Originally inferred tree
PHYLIP
Phylogeny
Software
title A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T21%3A53%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20modification%20of%20the%20PHYLIP%20program:%20A%20solution%20for%20the%20redundant%20cluster%20problem,%20and%20an%20implementation%20of%20an%20automatic%20bootstrapping%20on%20trees%20inferred%20from%20original%20data&rft.jtitle=Molecular%20phylogenetics%20and%20evolution&rft.au=Shimada,%20Makoto%20K.&rft.date=2017-04&rft.volume=109&rft.spage=409&rft.epage=414&rft.pages=409-414&rft.issn=1055-7903&rft.eissn=1095-9513&rft_id=info:doi/10.1016/j.ympev.2017.02.012&rft_dat=%3Cproquest_cross%3E1872579866%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1872579866&rft_id=info:pmid/28232198&rft_els_id=S1055790316304523&rfr_iscdi=true