Accelerating Multiple Sequence Alignment with the Cell BE Processor

The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorizatio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computer journal 2010-07, Vol.53 (6), p.814-826
Hauptverfasser:	Vandierendonck, Hans, Rul, Sean, De Bosschere, Koen
Format:	Artikel
Sprache:	eng
Schlagworte:	Alignment Architecture Array processors Blades Boundary element method Broadband Complexity Computation Computer architecture Engines Mailboxes Mathematical analysis Microprocessors Parallel processing Programming Running Vector processing (computers) Workload
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	826
container_issue	6
container_start_page	814
container_title	Computer journal
container_volume	53
creator	Vandierendonck, Hans Rul, Sean De Bosschere, Koen
description	The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]
doi_str_mv	10.1093/comjnl/bxp086
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_753678845</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2097760681</sourcerecordid><originalsourceid>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</originalsourceid><addsrcrecordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>734587128</pqid></control><display><type>article</type><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creator><creatorcontrib>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creatorcontrib><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><identifier>ISSN: 0010-4620</identifier><identifier>EISSN: 1460-2067</identifier><identifier>DOI: 10.1093/comjnl/bxp086</identifier><identifier>CODEN: CMPJAG</identifier><language>eng</language><publisher>Oxford: Oxford Publishing Limited (England)</publisher><subject>Alignment ; Architecture ; Array processors ; Blades ; Boundary element method ; Broadband ; Complexity ; Computation ; Computer architecture ; Engines ; Mailboxes ; Mathematical analysis ; Microprocessors ; Parallel processing ; Programming ; Running ; Vector processing (computers) ; Workload</subject><ispartof>Computer journal, 2010-07, Vol.53 (6), p.814-826</ispartof><rights>Copyright Oxford Publishing Limited(England) Jul 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,27926,27927</link.rule.ids></links><search><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><title>Computer journal</title><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><subject>Alignment</subject><subject>Architecture</subject><subject>Array processors</subject><subject>Blades</subject><subject>Boundary element method</subject><subject>Broadband</subject><subject>Complexity</subject><subject>Computation</subject><subject>Computer architecture</subject><subject>Engines</subject><subject>Mailboxes</subject><subject>Mathematical analysis</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Programming</subject><subject>Running</subject><subject>Vector processing (computers)</subject><subject>Workload</subject><issn>0010-4620</issn><issn>1460-2067</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Vandierendonck, Hans</creator><creator>Rul, Sean</creator><creator>De Bosschere, Koen</creator><general>Oxford Publishing Limited (England)</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><author>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Alignment</topic><topic>Architecture</topic><topic>Array processors</topic><topic>Blades</topic><topic>Boundary element method</topic><topic>Broadband</topic><topic>Complexity</topic><topic>Computation</topic><topic>Computer architecture</topic><topic>Engines</topic><topic>Mailboxes</topic><topic>Mathematical analysis</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Programming</topic><topic>Running</topic><topic>Vector processing (computers)</topic><topic>Workload</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vandierendonck, Hans</au><au>Rul, Sean</au><au>De Bosschere, Koen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Accelerating Multiple Sequence Alignment with the Cell BE Processor</atitle><jtitle>Computer journal</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>53</volume><issue>6</issue><spage>814</spage><epage>826</epage><pages>814-826</pages><issn>0010-4620</issn><eissn>1460-2067</eissn><coden>CMPJAG</coden><abstract>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</abstract><cop>Oxford</cop><pub>Oxford Publishing Limited (England)</pub><doi>10.1093/comjnl/bxp086</doi><tpages>13</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0010-4620
ispartof	Computer journal, 2010-07, Vol.53 (6), p.814-826
issn	0010-4620 1460-2067
language	eng
recordid	cdi_proquest_miscellaneous_753678845
source	Oxford University Press Journals All Titles (1996-Current)
subjects	Alignment Architecture Array processors Blades Boundary element method Broadband Complexity Computation Computer architecture Engines Mailboxes Mathematical analysis Microprocessors Parallel processing Programming Running Vector processing (computers) Workload
title	Accelerating Multiple Sequence Alignment with the Cell BE Processor
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T11%3A02%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Accelerating%20Multiple%20Sequence%20Alignment%20with%20the%20Cell%20BE%20Processor&rft.jtitle=Computer%20journal&rft.au=Vandierendonck,%20Hans&rft.date=2010-07-01&rft.volume=53&rft.issue=6&rft.spage=814&rft.epage=826&rft.pages=814-826&rft.issn=0010-4620&rft.eissn=1460-2067&rft.coden=CMPJAG&rft_id=info:doi/10.1093/comjnl/bxp086&rft_dat=%3Cproquest_cross%3E2097760681%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=734587128&rft_id=info:pmid/&rfr_iscdi=true