A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data
[Display omitted] •We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLI...
Gespeichert in:
Veröffentlicht in: | Molecular phylogenetics and evolution 2017-04, Vol.109, p.409-414 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 414 |
---|---|
container_issue | |
container_start_page | 409 |
container_title | Molecular phylogenetics and evolution |
container_volume | 109 |
creator | Shimada, Makoto K. Nishida, Tsunetoshi |
description | [Display omitted]
•We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data.
Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/. |
doi_str_mv | 10.1016/j.ympev.2017.02.012 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1872579866</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1055790316304523</els_id><sourcerecordid>1872579866</sourcerecordid><originalsourceid>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</originalsourceid><addsrcrecordid>eNp9kctuFDEQRS0EIiHwBUjISxbpxo-4H0gsRhGQSCORBSxYWW67PHjUthvbHSn_xEfimQlZsijVQ6duqXQRektJSwntPuzbB7_AfcsI7VvCWkLZM3ROySiaUVD-_FAL0fQj4WfoVc57QigVo3iJztjAOKPjcI7-bLCPxlmnVXEx4Ghx-QX47ubn9vYOLynukvIf8QbnOK9HwsZ0RBKYNRgVCtbzmgukAz3N4C-xCqYGdn6pLYTyJF2Hai3R14HGU4wll6SWxYUdrkBJABm7YCFVcWxT9Dgmt3NBzdiool6jF1bNGd485gv048vn79c3zfbb19vrzbbRV0yUhvUwjtzSSQ9TJwAE51d2Ggiz2hBi-qHTbBqYAK0GqzvCNaVGj7azxNa-4xfo_Um3fvR7hVykd1nDPKsAcc2SDj0T_TgcUX5CdYo5J7BySc6r9CApkQeb5F4ebZIHmyRhstpUt949HlgnD-Zp558vFfh0AqC-ee8gyawdBA3GJdBFmuj-e-AvujOo6Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1872579866</pqid></control><display><type>article</type><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><source>MEDLINE</source><source>ScienceDirect Journals (5 years ago - present)</source><creator>Shimada, Makoto K. ; Nishida, Tsunetoshi</creator><creatorcontrib>Shimada, Makoto K. ; Nishida, Tsunetoshi</creatorcontrib><description>[Display omitted]
•We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data.
Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</description><identifier>ISSN: 1055-7903</identifier><identifier>EISSN: 1095-9513</identifier><identifier>DOI: 10.1016/j.ympev.2017.02.012</identifier><identifier>PMID: 28232198</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Bootstrap analysis ; Floating point numbers ; Hash table ; Originally inferred tree ; PHYLIP ; Phylogeny ; Software</subject><ispartof>Molecular phylogenetics and evolution, 2017-04, Vol.109, p.409-414</ispartof><rights>2017 Elsevier Inc.</rights><rights>Copyright © 2017 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</citedby><cites>FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1055790316304523$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28232198$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Shimada, Makoto K.</creatorcontrib><creatorcontrib>Nishida, Tsunetoshi</creatorcontrib><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><title>Molecular phylogenetics and evolution</title><addtitle>Mol Phylogenet Evol</addtitle><description>[Display omitted]
•We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data.
Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</description><subject>Bootstrap analysis</subject><subject>Floating point numbers</subject><subject>Hash table</subject><subject>Originally inferred tree</subject><subject>PHYLIP</subject><subject>Phylogeny</subject><subject>Software</subject><issn>1055-7903</issn><issn>1095-9513</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kctuFDEQRS0EIiHwBUjISxbpxo-4H0gsRhGQSCORBSxYWW67PHjUthvbHSn_xEfimQlZsijVQ6duqXQRektJSwntPuzbB7_AfcsI7VvCWkLZM3ROySiaUVD-_FAL0fQj4WfoVc57QigVo3iJztjAOKPjcI7-bLCPxlmnVXEx4Ghx-QX47ubn9vYOLynukvIf8QbnOK9HwsZ0RBKYNRgVCtbzmgukAz3N4C-xCqYGdn6pLYTyJF2Hai3R14HGU4wll6SWxYUdrkBJABm7YCFVcWxT9Dgmt3NBzdiool6jF1bNGd485gv048vn79c3zfbb19vrzbbRV0yUhvUwjtzSSQ9TJwAE51d2Ggiz2hBi-qHTbBqYAK0GqzvCNaVGj7azxNa-4xfo_Um3fvR7hVykd1nDPKsAcc2SDj0T_TgcUX5CdYo5J7BySc6r9CApkQeb5F4ebZIHmyRhstpUt949HlgnD-Zp558vFfh0AqC-ee8gyawdBA3GJdBFmuj-e-AvujOo6Q</recordid><startdate>201704</startdate><enddate>201704</enddate><creator>Shimada, Makoto K.</creator><creator>Nishida, Tsunetoshi</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201704</creationdate><title>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</title><author>Shimada, Makoto K. ; Nishida, Tsunetoshi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c425t-27e993f1bc8b65ee5334fb802fcd00d786c2b825eca8fc603c11dc9f6f0ffc663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Bootstrap analysis</topic><topic>Floating point numbers</topic><topic>Hash table</topic><topic>Originally inferred tree</topic><topic>PHYLIP</topic><topic>Phylogeny</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shimada, Makoto K.</creatorcontrib><creatorcontrib>Nishida, Tsunetoshi</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Molecular phylogenetics and evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shimada, Makoto K.</au><au>Nishida, Tsunetoshi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data</atitle><jtitle>Molecular phylogenetics and evolution</jtitle><addtitle>Mol Phylogenet Evol</addtitle><date>2017-04</date><risdate>2017</risdate><volume>109</volume><spage>409</spage><epage>414</epage><pages>409-414</pages><issn>1055-7903</issn><eissn>1095-9513</eissn><abstract>[Display omitted]
•We corrected a software bug of PHYLIP, the evolutionary study package with the longest history.•The bug had been left for more than 25years.•This correction solves the redundancy in bootstrapping that cause incorrect inference of consensus tree.•We build an add-on program of PHYLIP that infers a tree from original data with the bootstrap values.•We build an bash script that enable automatic tree construction from unaligned sequence data.
Felsenstein’s PHYLIP package of molecular phylogeny tools has been used globally since 1980. The programs are receiving renewed attention because of their character-based user interface, which has the advantage of being scriptable for use with large-scale data studies based on super-computers or massively parallel computing clusters. However, occasionally we found, the PHYLIP Consense program output text file displays two or more divided bootstrap values for the same cluster in its result table, and when this happens the output Newick tree file incorrectly assigns only the last value to that cluster that disturbs correct estimation of a consensus tree. We ascertained the cause of this aberrant behavior in the bootstrapping calculation. Our rewrite of the Consense program source code outputs bootstrap values, without redundancy, in its result table, and a Newick tree file with appropriate, corresponding bootstrap values. Furthermore, we developed an add-on program and shell script, add_bootstrap.pl and fasta2tre_bs.bsh, to generate a Newick tree containing the topology and branch lengths inferred from the original data along with valid bootstrap values, and to actualize the automated inference of a phylogenetic tree containing the originally inferred topology and branch lengths with bootstrap values, from multiple unaligned sequences, respectively. These programs can be downloaded at: https://github.com/ShimadaMK/PHYLIP_enhance/.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>28232198</pmid><doi>10.1016/j.ympev.2017.02.012</doi><tpages>6</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1055-7903 |
ispartof | Molecular phylogenetics and evolution, 2017-04, Vol.109, p.409-414 |
issn | 1055-7903 1095-9513 |
language | eng |
recordid | cdi_proquest_miscellaneous_1872579866 |
source | MEDLINE; ScienceDirect Journals (5 years ago - present) |
subjects | Bootstrap analysis Floating point numbers Hash table Originally inferred tree PHYLIP Phylogeny Software |
title | A modification of the PHYLIP program: A solution for the redundant cluster problem, and an implementation of an automatic bootstrapping on trees inferred from original data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T21%3A53%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20modification%20of%20the%20PHYLIP%20program:%20A%20solution%20for%20the%20redundant%20cluster%20problem,%20and%20an%20implementation%20of%20an%20automatic%20bootstrapping%20on%20trees%20inferred%20from%20original%20data&rft.jtitle=Molecular%20phylogenetics%20and%20evolution&rft.au=Shimada,%20Makoto%20K.&rft.date=2017-04&rft.volume=109&rft.spage=409&rft.epage=414&rft.pages=409-414&rft.issn=1055-7903&rft.eissn=1095-9513&rft_id=info:doi/10.1016/j.ympev.2017.02.012&rft_dat=%3Cproquest_cross%3E1872579866%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1872579866&rft_id=info:pmid/28232198&rft_els_id=S1055790316304523&rfr_iscdi=true |