Bioinformatics algorithm development for Grid environments

A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of systems and software 2010-07, Vol.83 (7), p.1249-1257
Hauptverfasser: Psomopoulos, Fotis E., Mitkas, Pericles A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1257
container_issue 7
container_start_page 1249
container_title The Journal of systems and software
container_volume 83
creator Psomopoulos, Fotis E.
Mitkas, Pericles A.
description A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.
doi_str_mv 10.1016/j.jss.2010.01.051
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_896232269</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0164121210000373</els_id><sourcerecordid>2051496441</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwA9giFqYEnx07CUxQQUGqxAKz5TgXcJTExU4r8e9xVCYGpvt639PdQ8gl0AwoyJsu60LIGI01hYwKOCILKAueAmPlMVlETR5zYKfkLISOUlowyhbk9sE6O7bOD3qyJiS6_3DeTp9D0uAee7cdcJySOE_W3jYJjnvr3Tg3wzk5aXUf8OI3Lsn70-Pb6jndvK5fVveb1HAhp5SXkDeICKauW2yh5nkB1OSiMcBNg4A5K0td1LKocya54AxEBVUpdKsLBnxJrg97t9597TBMarDBYN_rEd0uqLKSjDMmq6i8-qPs3M6P8TjFpchFQem8Dg4i410IHlu19XbQ_lsBVTNL1anIUs0sFQUVWUbP3cGD8c-9Ra-CsTgabKxHM6nG2X_cP6lVe8o</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>365457001</pqid></control><display><type>article</type><title>Bioinformatics algorithm development for Grid environments</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</creator><creatorcontrib>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</creatorcontrib><description>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</description><identifier>ISSN: 0164-1212</identifier><identifier>EISSN: 1873-1228</identifier><identifier>DOI: 10.1016/j.jss.2010.01.051</identifier><identifier>CODEN: JSSODM</identifier><language>eng</language><publisher>New York: Elsevier Inc</publisher><subject>Algorithms ; Bioinformatics ; Data analysis ; Distributed processing ; Gene expression ; Grid computing ; Protein classification ; Proteins ; Semi-automated tool ; Studies ; Systems development ; Workflow design</subject><ispartof>The Journal of systems and software, 2010-07, Vol.83 (7), p.1249-1257</ispartof><rights>2010 Elsevier Inc.</rights><rights>Copyright Elsevier Sequoia S.A. Jul 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</citedby><cites>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jss.2010.01.051$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Psomopoulos, Fotis E.</creatorcontrib><creatorcontrib>Mitkas, Pericles A.</creatorcontrib><title>Bioinformatics algorithm development for Grid environments</title><title>The Journal of systems and software</title><description>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</description><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Data analysis</subject><subject>Distributed processing</subject><subject>Gene expression</subject><subject>Grid computing</subject><subject>Protein classification</subject><subject>Proteins</subject><subject>Semi-automated tool</subject><subject>Studies</subject><subject>Systems development</subject><subject>Workflow design</subject><issn>0164-1212</issn><issn>1873-1228</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwA9giFqYEnx07CUxQQUGqxAKz5TgXcJTExU4r8e9xVCYGpvt639PdQ8gl0AwoyJsu60LIGI01hYwKOCILKAueAmPlMVlETR5zYKfkLISOUlowyhbk9sE6O7bOD3qyJiS6_3DeTp9D0uAee7cdcJySOE_W3jYJjnvr3Tg3wzk5aXUf8OI3Lsn70-Pb6jndvK5fVveb1HAhp5SXkDeICKauW2yh5nkB1OSiMcBNg4A5K0td1LKocya54AxEBVUpdKsLBnxJrg97t9597TBMarDBYN_rEd0uqLKSjDMmq6i8-qPs3M6P8TjFpchFQem8Dg4i410IHlu19XbQ_lsBVTNL1anIUs0sFQUVWUbP3cGD8c-9Ra-CsTgabKxHM6nG2X_cP6lVe8o</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Psomopoulos, Fotis E.</creator><creator>Mitkas, Pericles A.</creator><general>Elsevier Inc</general><general>Elsevier Sequoia S.A</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7QO</scope><scope>FR3</scope><scope>P64</scope></search><sort><creationdate>20100701</creationdate><title>Bioinformatics algorithm development for Grid environments</title><author>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Data analysis</topic><topic>Distributed processing</topic><topic>Gene expression</topic><topic>Grid computing</topic><topic>Protein classification</topic><topic>Proteins</topic><topic>Semi-automated tool</topic><topic>Studies</topic><topic>Systems development</topic><topic>Workflow design</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Psomopoulos, Fotis E.</creatorcontrib><creatorcontrib>Mitkas, Pericles A.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology Research Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><jtitle>The Journal of systems and software</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Psomopoulos, Fotis E.</au><au>Mitkas, Pericles A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bioinformatics algorithm development for Grid environments</atitle><jtitle>The Journal of systems and software</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>83</volume><issue>7</issue><spage>1249</spage><epage>1257</epage><pages>1249-1257</pages><issn>0164-1212</issn><eissn>1873-1228</eissn><coden>JSSODM</coden><abstract>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</abstract><cop>New York</cop><pub>Elsevier Inc</pub><doi>10.1016/j.jss.2010.01.051</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0164-1212
ispartof The Journal of systems and software, 2010-07, Vol.83 (7), p.1249-1257
issn 0164-1212
1873-1228
language eng
recordid cdi_proquest_miscellaneous_896232269
source Elsevier ScienceDirect Journals Complete
subjects Algorithms
Bioinformatics
Data analysis
Distributed processing
Gene expression
Grid computing
Protein classification
Proteins
Semi-automated tool
Studies
Systems development
Workflow design
title Bioinformatics algorithm development for Grid environments
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T04%3A40%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bioinformatics%20algorithm%20development%20for%20Grid%20environments&rft.jtitle=The%20Journal%20of%20systems%20and%20software&rft.au=Psomopoulos,%20Fotis%20E.&rft.date=2010-07-01&rft.volume=83&rft.issue=7&rft.spage=1249&rft.epage=1257&rft.pages=1249-1257&rft.issn=0164-1212&rft.eissn=1873-1228&rft.coden=JSSODM&rft_id=info:doi/10.1016/j.jss.2010.01.051&rft_dat=%3Cproquest_cross%3E2051496441%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=365457001&rft_id=info:pmid/&rft_els_id=S0164121210000373&rfr_iscdi=true