Bioinformatics algorithm development for Grid environments

A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of systems and software 2010-07, Vol.83 (7), p.1249-1257
Hauptverfasser:	Psomopoulos, Fotis E., Mitkas, Pericles A.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Bioinformatics Data analysis Distributed processing Gene expression Grid computing Protein classification Proteins Semi-automated tool Studies Systems development Workflow design
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1257
container_issue	7
container_start_page	1249
container_title	The Journal of systems and software
container_volume	83
creator	Psomopoulos, Fotis E. Mitkas, Pericles A.
description	A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.
doi_str_mv	10.1016/j.jss.2010.01.051
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_896232269</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0164121210000373</els_id><sourcerecordid>2051496441</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwA9giFqYEnx07CUxQQUGqxAKz5TgXcJTExU4r8e9xVCYGpvt639PdQ8gl0AwoyJsu60LIGI01hYwKOCILKAueAmPlMVlETR5zYKfkLISOUlowyhbk9sE6O7bOD3qyJiS6_3DeTp9D0uAee7cdcJySOE_W3jYJjnvr3Tg3wzk5aXUf8OI3Lsn70-Pb6jndvK5fVveb1HAhp5SXkDeICKauW2yh5nkB1OSiMcBNg4A5K0td1LKocya54AxEBVUpdKsLBnxJrg97t9597TBMarDBYN_rEd0uqLKSjDMmq6i8-qPs3M6P8TjFpchFQem8Dg4i410IHlu19XbQ_lsBVTNL1anIUs0sFQUVWUbP3cGD8c-9Ra-CsTgabKxHM6nG2X_cP6lVe8o</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>365457001</pqid></control><display><type>article</type><title>Bioinformatics algorithm development for Grid environments</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</creator><creatorcontrib>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</creatorcontrib><description>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</description><identifier>ISSN: 0164-1212</identifier><identifier>EISSN: 1873-1228</identifier><identifier>DOI: 10.1016/j.jss.2010.01.051</identifier><identifier>CODEN: JSSODM</identifier><language>eng</language><publisher>New York: Elsevier Inc</publisher><subject>Algorithms ; Bioinformatics ; Data analysis ; Distributed processing ; Gene expression ; Grid computing ; Protein classification ; Proteins ; Semi-automated tool ; Studies ; Systems development ; Workflow design</subject><ispartof>The Journal of systems and software, 2010-07, Vol.83 (7), p.1249-1257</ispartof><rights>2010 Elsevier Inc.</rights><rights>Copyright Elsevier Sequoia S.A. Jul 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</citedby><cites>FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jss.2010.01.051$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Psomopoulos, Fotis E.</creatorcontrib><creatorcontrib>Mitkas, Pericles A.</creatorcontrib><title>Bioinformatics algorithm development for Grid environments</title><title>The Journal of systems and software</title><description>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</description><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Data analysis</subject><subject>Distributed processing</subject><subject>Gene expression</subject><subject>Grid computing</subject><subject>Protein classification</subject><subject>Proteins</subject><subject>Semi-automated tool</subject><subject>Studies</subject><subject>Systems development</subject><subject>Workflow design</subject><issn>0164-1212</issn><issn>1873-1228</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwA9giFqYEnx07CUxQQUGqxAKz5TgXcJTExU4r8e9xVCYGpvt639PdQ8gl0AwoyJsu60LIGI01hYwKOCILKAueAmPlMVlETR5zYKfkLISOUlowyhbk9sE6O7bOD3qyJiS6_3DeTp9D0uAee7cdcJySOE_W3jYJjnvr3Tg3wzk5aXUf8OI3Lsn70-Pb6jndvK5fVveb1HAhp5SXkDeICKauW2yh5nkB1OSiMcBNg4A5K0td1LKocya54AxEBVUpdKsLBnxJrg97t9597TBMarDBYN_rEd0uqLKSjDMmq6i8-qPs3M6P8TjFpchFQem8Dg4i410IHlu19XbQ_lsBVTNL1anIUs0sFQUVWUbP3cGD8c-9Ra-CsTgabKxHM6nG2X_cP6lVe8o</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Psomopoulos, Fotis E.</creator><creator>Mitkas, Pericles A.</creator><general>Elsevier Inc</general><general>Elsevier Sequoia S.A</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7QO</scope><scope>FR3</scope><scope>P64</scope></search><sort><creationdate>20100701</creationdate><title>Bioinformatics algorithm development for Grid environments</title><author>Psomopoulos, Fotis E. ; Mitkas, Pericles A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-3814deee1cbbfef1b34710c45dc13cde1e4288a7b67b42635321591985afa7213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Data analysis</topic><topic>Distributed processing</topic><topic>Gene expression</topic><topic>Grid computing</topic><topic>Protein classification</topic><topic>Proteins</topic><topic>Semi-automated tool</topic><topic>Studies</topic><topic>Systems development</topic><topic>Workflow design</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Psomopoulos, Fotis E.</creatorcontrib><creatorcontrib>Mitkas, Pericles A.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology Research Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><jtitle>The Journal of systems and software</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Psomopoulos, Fotis E.</au><au>Mitkas, Pericles A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bioinformatics algorithm development for Grid environments</atitle><jtitle>The Journal of systems and software</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>83</volume><issue>7</issue><spage>1249</spage><epage>1257</epage><pages>1249-1257</pages><issn>0164-1212</issn><eissn>1873-1228</eissn><coden>JSSODM</coden><abstract>A Grid environment can be viewed as a virtual computing architecture that provides the ability to perform higher throughput computing by taking advantage of many computers geographically dispersed and connected by a network. Bioinformatics applications stand to gain in such a distributed environment in terms of increased availability, reliability and efficiency of computational resources. There is already considerable research in progress toward applying parallel computing techniques on bioinformatics methods, such as multiple sequence alignment, gene expression analysis and phylogenetic studies. In order to cope with the dimensionality issue, most machine learning methods either focus on specific groups of proteins or reduce the size of the original data set and/or the number of attributes involved. Grid computing could potentially provide an alternative solution to this problem, by combining multiple approaches in a seamless way. In this paper we introduce a unifying methodology coupling the strengths of the Grid with the specific needs and constraints of the major bioinformatics approaches. We also present a tool that implements this process and allows researchers to assess the computational needs for a specific task and optimize the allocation of available resources for its efficient completion.</abstract><cop>New York</cop><pub>Elsevier Inc</pub><doi>10.1016/j.jss.2010.01.051</doi><tpages>9</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0164-1212
ispartof	The Journal of systems and software, 2010-07, Vol.83 (7), p.1249-1257
issn	0164-1212 1873-1228
language	eng
recordid	cdi_proquest_miscellaneous_896232269
source	Elsevier ScienceDirect Journals Complete
subjects	Algorithms Bioinformatics Data analysis Distributed processing Gene expression Grid computing Protein classification Proteins Semi-automated tool Studies Systems development Workflow design
title	Bioinformatics algorithm development for Grid environments
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T04%3A40%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bioinformatics%20algorithm%20development%20for%20Grid%20environments&rft.jtitle=The%20Journal%20of%20systems%20and%20software&rft.au=Psomopoulos,%20Fotis%20E.&rft.date=2010-07-01&rft.volume=83&rft.issue=7&rft.spage=1249&rft.epage=1257&rft.pages=1249-1257&rft.issn=0164-1212&rft.eissn=1873-1228&rft.coden=JSSODM&rft_id=info:doi/10.1016/j.jss.2010.01.051&rft_dat=%3Cproquest_cross%3E2051496441%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=365457001&rft_id=info:pmid/&rft_els_id=S0164121210000373&rfr_iscdi=true