High performance phylogenetic inference

Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Clement, M., Snell, Q., Judd, G., Whiting, M.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 336
container_issue
container_start_page 335
container_title
container_volume
creator Clement, M.
Snell, Q.
Judd, G.
Whiting, M.
description Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission, biogeography, and systematics. Researchers are now commonly generating many DNA sequences from many individuals, thus creating very large data sets. However, our ability to analyze the data has not kept pace with data generation, and phylogenetics has now reached a crossroads where we cannot effectively analyze the data we generate. The chief challenge of phylogenetic systematics in the next century will be to develop algorithms and search strategies to effectively analyze large data sets. The crux of the computational problem is that the actual landscape of possible topologies can be extraordinarily difficult to evaluate with large data sets. The parsimony ratchet is actually a family of iterative tree search methods that use a statistical approach to sampling tree islands and ultimately finding the most parsimonious trees for a data set. Each iteration of the parsimony ratchet may occur in parallel as there is no direct dependency between iterations. The authors' implementation of the parallel ratchet is masterworker based. A master process is launched in the DOGMA system which then launches worker tasks on available nodes. Each worker task is simply wrapper code that is used to interact with the newest release version of NONA.
doi_str_mv 10.1109/HPDC.1999.805315
format Conference Proceeding
fullrecord <record><control><sourceid>proquest_6IE</sourceid><recordid>TN_cdi_ieee_primary_805315</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>805315</ieee_id><sourcerecordid>26967063</sourcerecordid><originalsourceid>FETCH-LOGICAL-i118t-1201e709f830f680f92e76f6c7a4fb9ed0dbfa87ed70e98567c3620c1e4ae2c33</originalsourceid><addsrcrecordid>eNotj0tLw0AURgdUsK3uxVVWukq8d6aZx1LiI0JBF7oO08mddiQvJ-mi_95KXH1wOBz4GLtByBDBPJQfT0WGxphMQy4wP2NLUBpELjXCOVsgaJ5qA-qSLcfxG4ADcrVg92XY7ZOBou9jaztHybA_Nv2OOpqCS0LnKdIJX7ELb5uRrv93xb5enj-LMt28v74Vj5s0IOopxVOWFBivBXipwRtOSnrplF37raEa6q23WlGtgIzOpXJCcnBIa0vcCbFid3N3iP3PgcapasPoqGlsR_1hrLg0UoH8E29nMRBRNcTQ2nis5vPiF2mPTQE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>26967063</pqid></control><display><type>conference_proceeding</type><title>High performance phylogenetic inference</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Clement, M. ; Snell, Q. ; Judd, G. ; Whiting, M.</creator><creatorcontrib>Clement, M. ; Snell, Q. ; Judd, G. ; Whiting, M.</creatorcontrib><description>Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission, biogeography, and systematics. Researchers are now commonly generating many DNA sequences from many individuals, thus creating very large data sets. However, our ability to analyze the data has not kept pace with data generation, and phylogenetics has now reached a crossroads where we cannot effectively analyze the data we generate. The chief challenge of phylogenetic systematics in the next century will be to develop algorithms and search strategies to effectively analyze large data sets. The crux of the computational problem is that the actual landscape of possible topologies can be extraordinarily difficult to evaluate with large data sets. The parsimony ratchet is actually a family of iterative tree search methods that use a statistical approach to sampling tree islands and ultimately finding the most parsimonious trees for a data set. Each iteration of the parsimony ratchet may occur in parallel as there is no direct dependency between iterations. The authors' implementation of the parallel ratchet is masterworker based. A master process is launched in the DOGMA system which then launches worker tasks on available nodes. Each worker task is simply wrapper code that is used to interact with the newest release version of NONA.</description><identifier>ISSN: 1082-8907</identifier><identifier>ISBN: 0780356810</identifier><identifier>ISBN: 9780780356818</identifier><identifier>DOI: 10.1109/HPDC.1999.805315</identifier><language>eng</language><publisher>IEEE</publisher><subject>Biogeography ; Biological information theory ; Data analysis ; DNA ; Genetic mutations ; Humans ; Iterative algorithms ; Phylogeny ; Sequences ; Systematics</subject><ispartof>Proceedings of the IEEE International Symposium on High Performance Distributed Computing, 1999, p.335-336</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/805315$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/805315$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Clement, M.</creatorcontrib><creatorcontrib>Snell, Q.</creatorcontrib><creatorcontrib>Judd, G.</creatorcontrib><creatorcontrib>Whiting, M.</creatorcontrib><title>High performance phylogenetic inference</title><title>Proceedings of the IEEE International Symposium on High Performance Distributed Computing</title><addtitle>HPDC</addtitle><description>Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission, biogeography, and systematics. Researchers are now commonly generating many DNA sequences from many individuals, thus creating very large data sets. However, our ability to analyze the data has not kept pace with data generation, and phylogenetics has now reached a crossroads where we cannot effectively analyze the data we generate. The chief challenge of phylogenetic systematics in the next century will be to develop algorithms and search strategies to effectively analyze large data sets. The crux of the computational problem is that the actual landscape of possible topologies can be extraordinarily difficult to evaluate with large data sets. The parsimony ratchet is actually a family of iterative tree search methods that use a statistical approach to sampling tree islands and ultimately finding the most parsimonious trees for a data set. Each iteration of the parsimony ratchet may occur in parallel as there is no direct dependency between iterations. The authors' implementation of the parallel ratchet is masterworker based. A master process is launched in the DOGMA system which then launches worker tasks on available nodes. Each worker task is simply wrapper code that is used to interact with the newest release version of NONA.</description><subject>Biogeography</subject><subject>Biological information theory</subject><subject>Data analysis</subject><subject>DNA</subject><subject>Genetic mutations</subject><subject>Humans</subject><subject>Iterative algorithms</subject><subject>Phylogeny</subject><subject>Sequences</subject><subject>Systematics</subject><issn>1082-8907</issn><isbn>0780356810</isbn><isbn>9780780356818</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1999</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj0tLw0AURgdUsK3uxVVWukq8d6aZx1LiI0JBF7oO08mddiQvJ-mi_95KXH1wOBz4GLtByBDBPJQfT0WGxphMQy4wP2NLUBpELjXCOVsgaJ5qA-qSLcfxG4ADcrVg92XY7ZOBou9jaztHybA_Nv2OOpqCS0LnKdIJX7ELb5uRrv93xb5enj-LMt28v74Vj5s0IOopxVOWFBivBXipwRtOSnrplF37raEa6q23WlGtgIzOpXJCcnBIa0vcCbFid3N3iP3PgcapasPoqGlsR_1hrLg0UoH8E29nMRBRNcTQ2nis5vPiF2mPTQE</recordid><startdate>1999</startdate><enddate>1999</enddate><creator>Clement, M.</creator><creator>Snell, Q.</creator><creator>Judd, G.</creator><creator>Whiting, M.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>1999</creationdate><title>High performance phylogenetic inference</title><author>Clement, M. ; Snell, Q. ; Judd, G. ; Whiting, M.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i118t-1201e709f830f680f92e76f6c7a4fb9ed0dbfa87ed70e98567c3620c1e4ae2c33</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Biogeography</topic><topic>Biological information theory</topic><topic>Data analysis</topic><topic>DNA</topic><topic>Genetic mutations</topic><topic>Humans</topic><topic>Iterative algorithms</topic><topic>Phylogeny</topic><topic>Sequences</topic><topic>Systematics</topic><toplevel>online_resources</toplevel><creatorcontrib>Clement, M.</creatorcontrib><creatorcontrib>Snell, Q.</creatorcontrib><creatorcontrib>Judd, G.</creatorcontrib><creatorcontrib>Whiting, M.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Clement, M.</au><au>Snell, Q.</au><au>Judd, G.</au><au>Whiting, M.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>High performance phylogenetic inference</atitle><btitle>Proceedings of the IEEE International Symposium on High Performance Distributed Computing</btitle><stitle>HPDC</stitle><date>1999</date><risdate>1999</risdate><spage>335</spage><epage>336</epage><pages>335-336</pages><issn>1082-8907</issn><isbn>0780356810</isbn><isbn>9780780356818</isbn><abstract>Phylogenetic analysis is an integral part of many biological research programs. In essence, it is the study of gene genealogy. It is the study of gene mutation and the generational relationships. Phylogenetic analysis is being used in many diverse areas such as human epidemiology, viral transmission, biogeography, and systematics. Researchers are now commonly generating many DNA sequences from many individuals, thus creating very large data sets. However, our ability to analyze the data has not kept pace with data generation, and phylogenetics has now reached a crossroads where we cannot effectively analyze the data we generate. The chief challenge of phylogenetic systematics in the next century will be to develop algorithms and search strategies to effectively analyze large data sets. The crux of the computational problem is that the actual landscape of possible topologies can be extraordinarily difficult to evaluate with large data sets. The parsimony ratchet is actually a family of iterative tree search methods that use a statistical approach to sampling tree islands and ultimately finding the most parsimonious trees for a data set. Each iteration of the parsimony ratchet may occur in parallel as there is no direct dependency between iterations. The authors' implementation of the parallel ratchet is masterworker based. A master process is launched in the DOGMA system which then launches worker tasks on available nodes. Each worker task is simply wrapper code that is used to interact with the newest release version of NONA.</abstract><pub>IEEE</pub><doi>10.1109/HPDC.1999.805315</doi><tpages>2</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1082-8907
ispartof Proceedings of the IEEE International Symposium on High Performance Distributed Computing, 1999, p.335-336
issn 1082-8907
language eng
recordid cdi_ieee_primary_805315
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Biogeography
Biological information theory
Data analysis
DNA
Genetic mutations
Humans
Iterative algorithms
Phylogeny
Sequences
Systematics
title High performance phylogenetic inference
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T23%3A51%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=High%20performance%20phylogenetic%20inference&rft.btitle=Proceedings%20of%20the%20IEEE%20International%20Symposium%20on%20High%20Performance%20Distributed%20Computing&rft.au=Clement,%20M.&rft.date=1999&rft.spage=335&rft.epage=336&rft.pages=335-336&rft.issn=1082-8907&rft.isbn=0780356810&rft.isbn_list=9780780356818&rft_id=info:doi/10.1109/HPDC.1999.805315&rft_dat=%3Cproquest_6IE%3E26967063%3C/proquest_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=26967063&rft_id=info:pmid/&rft_ieee_id=805315&rfr_iscdi=true