Biopipe: a flexible framework for protocol-based bioinformatics analysis

We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessib...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genome research 2003-08, Vol.13 (8), p.1904-1915
Hauptverfasser: Hoon, Shawn, Ratnapu, Kiran Kumar, Chia, Jer-Ming, Kumarasamy, Balamurugan, Juguang, Xiao, Clamp, Michele, Stabenau, Arne, Potter, Simon, Clarke, Laura, Stupka, Elia
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1915
container_issue 8
container_start_page 1904
container_title Genome research
container_volume 13
creator Hoon, Shawn
Ratnapu, Kiran Kumar
Chia, Jer-Ming
Kumarasamy, Balamurugan
Juguang, Xiao
Clamp, Michele
Stabenau, Arne
Potter, Simon
Clarke, Laura
Stupka, Elia
description We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessibility. Secondly, the tsunami of data to be handled will require robust systems that enable bioinformatics analysis to be carried out in a parallel fashion. Thirdly, the ever-evolving state of bioinformatics presents new algorithms and paradigms in conducting analysis. This means that any bioinformatics framework must be flexible and generic enough to accommodate such changes. In addition, we identify the need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis. This makes it easier for experimentation and replication of results by external parties. Biopipe is designed in an effort to meet these goals. It aims to allow researchers to focus on protocol design. At the same time, it is designed to work over a compute farm and thus provides high-throughput performance. A common exchange format that encapsulates the entire protocol in terms of the analysis modules, parameters, and data versions has been developed to provide a powerful way in which to distribute and reproduce results. This will enable researchers to discuss and interpret the data better as the once implicit assumptions are now explicitly defined within the Biopipe framework.
doi_str_mv 10.1101/gr.1363103
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_403782</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>18829995</sourcerecordid><originalsourceid>FETCH-LOGICAL-c470t-1017b73bbd9bc35b2af73b0ac00f9366ac5af6b2bb56c928cfc438c32f6a17953</originalsourceid><addsrcrecordid>eNqFkT1PwzAQhj2AaCks_ACUiQEpxY4TfyAxQAUUqRILzJbt2sXgxMFOgf57AkR8TEynu3vu9EgvAAcIThGC6GQVpwgTjCDeAmMEGcs5rNAI7Kb0CCHEJWM7YIQKRnhF-RjML1xoXWtOM5lZb96c8iazUdbmNcSnzIaYtTF0QQefK5nMMlMuuKaf17JzOmWykX6TXNoD21b6ZPaHOgH3V5d3s3m-uL2-mZ0vcl1S2OW9I1UUK7XkSuNKFdL2HZQaQssxIVJX0hJVKFURzQumrS4x07iwRCLKKzwBZ19_27WqzVKbpovSiza6WsaNCNKJv5vGPYhVeBElxJQV_f3RcB_D89qkTtQuaeO9bExYJ0FxxSnj5F8QMVZw_ml0_AXqGFKKxn7LICg-QhGrKIZQevjwt_4POiSC3wHOl4u8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>18829995</pqid></control><display><type>article</type><title>Biopipe: a flexible framework for protocol-based bioinformatics analysis</title><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Hoon, Shawn ; Ratnapu, Kiran Kumar ; Chia, Jer-Ming ; Kumarasamy, Balamurugan ; Juguang, Xiao ; Clamp, Michele ; Stabenau, Arne ; Potter, Simon ; Clarke, Laura ; Stupka, Elia</creator><creatorcontrib>Hoon, Shawn ; Ratnapu, Kiran Kumar ; Chia, Jer-Ming ; Kumarasamy, Balamurugan ; Juguang, Xiao ; Clamp, Michele ; Stabenau, Arne ; Potter, Simon ; Clarke, Laura ; Stupka, Elia</creatorcontrib><description>We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessibility. Secondly, the tsunami of data to be handled will require robust systems that enable bioinformatics analysis to be carried out in a parallel fashion. Thirdly, the ever-evolving state of bioinformatics presents new algorithms and paradigms in conducting analysis. This means that any bioinformatics framework must be flexible and generic enough to accommodate such changes. In addition, we identify the need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis. This makes it easier for experimentation and replication of results by external parties. Biopipe is designed in an effort to meet these goals. It aims to allow researchers to focus on protocol design. At the same time, it is designed to work over a compute farm and thus provides high-throughput performance. A common exchange format that encapsulates the entire protocol in terms of the analysis modules, parameters, and data versions has been developed to provide a powerful way in which to distribute and reproduce results. This will enable researchers to discuss and interpret the data better as the once implicit assumptions are now explicitly defined within the Biopipe framework.</description><identifier>ISSN: 1088-9051</identifier><identifier>ISSN: 1054-9803</identifier><identifier>DOI: 10.1101/gr.1363103</identifier><identifier>PMID: 12869579</identifier><language>eng</language><publisher>United States: Cold Spring Harbor Laboratory Press</publisher><subject>Amino Acid Sequence ; Animals ; Computational Biology - methods ; Databases, Protein ; Drosophila Proteins - genetics ; Humans ; Methods ; Phylogeny ; Proteins - genetics ; Research Design - trends ; Software ; Software Design ; Takifugu - genetics</subject><ispartof>Genome research, 2003-08, Vol.13 (8), p.1904-1915</ispartof><rights>Copyright © 2003, Cold Spring Harbor Laboratory Press 2003</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c470t-1017b73bbd9bc35b2af73b0ac00f9366ac5af6b2bb56c928cfc438c32f6a17953</citedby><cites>FETCH-LOGICAL-c470t-1017b73bbd9bc35b2af73b0ac00f9366ac5af6b2bb56c928cfc438c32f6a17953</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC403782/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC403782/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27922,27923,53789,53791</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12869579$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hoon, Shawn</creatorcontrib><creatorcontrib>Ratnapu, Kiran Kumar</creatorcontrib><creatorcontrib>Chia, Jer-Ming</creatorcontrib><creatorcontrib>Kumarasamy, Balamurugan</creatorcontrib><creatorcontrib>Juguang, Xiao</creatorcontrib><creatorcontrib>Clamp, Michele</creatorcontrib><creatorcontrib>Stabenau, Arne</creatorcontrib><creatorcontrib>Potter, Simon</creatorcontrib><creatorcontrib>Clarke, Laura</creatorcontrib><creatorcontrib>Stupka, Elia</creatorcontrib><title>Biopipe: a flexible framework for protocol-based bioinformatics analysis</title><title>Genome research</title><addtitle>Genome Res</addtitle><description>We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessibility. Secondly, the tsunami of data to be handled will require robust systems that enable bioinformatics analysis to be carried out in a parallel fashion. Thirdly, the ever-evolving state of bioinformatics presents new algorithms and paradigms in conducting analysis. This means that any bioinformatics framework must be flexible and generic enough to accommodate such changes. In addition, we identify the need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis. This makes it easier for experimentation and replication of results by external parties. Biopipe is designed in an effort to meet these goals. It aims to allow researchers to focus on protocol design. At the same time, it is designed to work over a compute farm and thus provides high-throughput performance. A common exchange format that encapsulates the entire protocol in terms of the analysis modules, parameters, and data versions has been developed to provide a powerful way in which to distribute and reproduce results. This will enable researchers to discuss and interpret the data better as the once implicit assumptions are now explicitly defined within the Biopipe framework.</description><subject>Amino Acid Sequence</subject><subject>Animals</subject><subject>Computational Biology - methods</subject><subject>Databases, Protein</subject><subject>Drosophila Proteins - genetics</subject><subject>Humans</subject><subject>Methods</subject><subject>Phylogeny</subject><subject>Proteins - genetics</subject><subject>Research Design - trends</subject><subject>Software</subject><subject>Software Design</subject><subject>Takifugu - genetics</subject><issn>1088-9051</issn><issn>1054-9803</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkT1PwzAQhj2AaCks_ACUiQEpxY4TfyAxQAUUqRILzJbt2sXgxMFOgf57AkR8TEynu3vu9EgvAAcIThGC6GQVpwgTjCDeAmMEGcs5rNAI7Kb0CCHEJWM7YIQKRnhF-RjML1xoXWtOM5lZb96c8iazUdbmNcSnzIaYtTF0QQefK5nMMlMuuKaf17JzOmWykX6TXNoD21b6ZPaHOgH3V5d3s3m-uL2-mZ0vcl1S2OW9I1UUK7XkSuNKFdL2HZQaQssxIVJX0hJVKFURzQumrS4x07iwRCLKKzwBZ19_27WqzVKbpovSiza6WsaNCNKJv5vGPYhVeBElxJQV_f3RcB_D89qkTtQuaeO9bExYJ0FxxSnj5F8QMVZw_ml0_AXqGFKKxn7LICg-QhGrKIZQevjwt_4POiSC3wHOl4u8</recordid><startdate>20030801</startdate><enddate>20030801</enddate><creator>Hoon, Shawn</creator><creator>Ratnapu, Kiran Kumar</creator><creator>Chia, Jer-Ming</creator><creator>Kumarasamy, Balamurugan</creator><creator>Juguang, Xiao</creator><creator>Clamp, Michele</creator><creator>Stabenau, Arne</creator><creator>Potter, Simon</creator><creator>Clarke, Laura</creator><creator>Stupka, Elia</creator><general>Cold Spring Harbor Laboratory Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7TM</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20030801</creationdate><title>Biopipe: a flexible framework for protocol-based bioinformatics analysis</title><author>Hoon, Shawn ; Ratnapu, Kiran Kumar ; Chia, Jer-Ming ; Kumarasamy, Balamurugan ; Juguang, Xiao ; Clamp, Michele ; Stabenau, Arne ; Potter, Simon ; Clarke, Laura ; Stupka, Elia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c470t-1017b73bbd9bc35b2af73b0ac00f9366ac5af6b2bb56c928cfc438c32f6a17953</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Amino Acid Sequence</topic><topic>Animals</topic><topic>Computational Biology - methods</topic><topic>Databases, Protein</topic><topic>Drosophila Proteins - genetics</topic><topic>Humans</topic><topic>Methods</topic><topic>Phylogeny</topic><topic>Proteins - genetics</topic><topic>Research Design - trends</topic><topic>Software</topic><topic>Software Design</topic><topic>Takifugu - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hoon, Shawn</creatorcontrib><creatorcontrib>Ratnapu, Kiran Kumar</creatorcontrib><creatorcontrib>Chia, Jer-Ming</creatorcontrib><creatorcontrib>Kumarasamy, Balamurugan</creatorcontrib><creatorcontrib>Juguang, Xiao</creatorcontrib><creatorcontrib>Clamp, Michele</creatorcontrib><creatorcontrib>Stabenau, Arne</creatorcontrib><creatorcontrib>Potter, Simon</creatorcontrib><creatorcontrib>Clarke, Laura</creatorcontrib><creatorcontrib>Stupka, Elia</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Nucleic Acids Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hoon, Shawn</au><au>Ratnapu, Kiran Kumar</au><au>Chia, Jer-Ming</au><au>Kumarasamy, Balamurugan</au><au>Juguang, Xiao</au><au>Clamp, Michele</au><au>Stabenau, Arne</au><au>Potter, Simon</au><au>Clarke, Laura</au><au>Stupka, Elia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Biopipe: a flexible framework for protocol-based bioinformatics analysis</atitle><jtitle>Genome research</jtitle><addtitle>Genome Res</addtitle><date>2003-08-01</date><risdate>2003</risdate><volume>13</volume><issue>8</issue><spage>1904</spage><epage>1915</epage><pages>1904-1915</pages><issn>1088-9051</issn><issn>1054-9803</issn><abstract>We identify several challenges facing bioinformatics analysis today. Firstly, to fulfill the promise of comparative studies, bioinformatics analysis will need to accommodate different sources of data residing in a federation of databases that, in turn, come in different formats and modes of accessibility. Secondly, the tsunami of data to be handled will require robust systems that enable bioinformatics analysis to be carried out in a parallel fashion. Thirdly, the ever-evolving state of bioinformatics presents new algorithms and paradigms in conducting analysis. This means that any bioinformatics framework must be flexible and generic enough to accommodate such changes. In addition, we identify the need for introducing an explicit protocol-based approach to bioinformatics analysis that will lend rigorousness to the analysis. This makes it easier for experimentation and replication of results by external parties. Biopipe is designed in an effort to meet these goals. It aims to allow researchers to focus on protocol design. At the same time, it is designed to work over a compute farm and thus provides high-throughput performance. A common exchange format that encapsulates the entire protocol in terms of the analysis modules, parameters, and data versions has been developed to provide a powerful way in which to distribute and reproduce results. This will enable researchers to discuss and interpret the data better as the once implicit assumptions are now explicitly defined within the Biopipe framework.</abstract><cop>United States</cop><pub>Cold Spring Harbor Laboratory Press</pub><pmid>12869579</pmid><doi>10.1101/gr.1363103</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1088-9051
ispartof Genome research, 2003-08, Vol.13 (8), p.1904-1915
issn 1088-9051
1054-9803
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_403782
source MEDLINE; PubMed Central; Alma/SFX Local Collection
subjects Amino Acid Sequence
Animals
Computational Biology - methods
Databases, Protein
Drosophila Proteins - genetics
Humans
Methods
Phylogeny
Proteins - genetics
Research Design - trends
Software
Software Design
Takifugu - genetics
title Biopipe: a flexible framework for protocol-based bioinformatics analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T11%3A01%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Biopipe:%20a%20flexible%20framework%20for%20protocol-based%20bioinformatics%20analysis&rft.jtitle=Genome%20research&rft.au=Hoon,%20Shawn&rft.date=2003-08-01&rft.volume=13&rft.issue=8&rft.spage=1904&rft.epage=1915&rft.pages=1904-1915&rft.issn=1088-9051&rft_id=info:doi/10.1101/gr.1363103&rft_dat=%3Cproquest_pubme%3E18829995%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=18829995&rft_id=info:pmid/12869579&rfr_iscdi=true