Accelerating Multiple Sequence Alignment with the Cell BE Processor

The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorizatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer journal 2010-07, Vol.53 (6), p.814-826
Hauptverfasser: Vandierendonck, Hans, Rul, Sean, De Bosschere, Koen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 826
container_issue 6
container_start_page 814
container_title Computer journal
container_volume 53
creator Vandierendonck, Hans
Rul, Sean
De Bosschere, Koen
description The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]
doi_str_mv 10.1093/comjnl/bxp086
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_753678845</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2097760681</sourcerecordid><originalsourceid>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</originalsourceid><addsrcrecordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>734587128</pqid></control><display><type>article</type><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creator><creatorcontrib>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creatorcontrib><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><identifier>ISSN: 0010-4620</identifier><identifier>EISSN: 1460-2067</identifier><identifier>DOI: 10.1093/comjnl/bxp086</identifier><identifier>CODEN: CMPJAG</identifier><language>eng</language><publisher>Oxford: Oxford Publishing Limited (England)</publisher><subject>Alignment ; Architecture ; Array processors ; Blades ; Boundary element method ; Broadband ; Complexity ; Computation ; Computer architecture ; Engines ; Mailboxes ; Mathematical analysis ; Microprocessors ; Parallel processing ; Programming ; Running ; Vector processing (computers) ; Workload</subject><ispartof>Computer journal, 2010-07, Vol.53 (6), p.814-826</ispartof><rights>Copyright Oxford Publishing Limited(England) Jul 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,27926,27927</link.rule.ids></links><search><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><title>Computer journal</title><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><subject>Alignment</subject><subject>Architecture</subject><subject>Array processors</subject><subject>Blades</subject><subject>Boundary element method</subject><subject>Broadband</subject><subject>Complexity</subject><subject>Computation</subject><subject>Computer architecture</subject><subject>Engines</subject><subject>Mailboxes</subject><subject>Mathematical analysis</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Programming</subject><subject>Running</subject><subject>Vector processing (computers)</subject><subject>Workload</subject><issn>0010-4620</issn><issn>1460-2067</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Vandierendonck, Hans</creator><creator>Rul, Sean</creator><creator>De Bosschere, Koen</creator><general>Oxford Publishing Limited (England)</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><author>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Alignment</topic><topic>Architecture</topic><topic>Array processors</topic><topic>Blades</topic><topic>Boundary element method</topic><topic>Broadband</topic><topic>Complexity</topic><topic>Computation</topic><topic>Computer architecture</topic><topic>Engines</topic><topic>Mailboxes</topic><topic>Mathematical analysis</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Programming</topic><topic>Running</topic><topic>Vector processing (computers)</topic><topic>Workload</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vandierendonck, Hans</au><au>Rul, Sean</au><au>De Bosschere, Koen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Accelerating Multiple Sequence Alignment with the Cell BE Processor</atitle><jtitle>Computer journal</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>53</volume><issue>6</issue><spage>814</spage><epage>826</epage><pages>814-826</pages><issn>0010-4620</issn><eissn>1460-2067</eissn><coden>CMPJAG</coden><abstract>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</abstract><cop>Oxford</cop><pub>Oxford Publishing Limited (England)</pub><doi>10.1093/comjnl/bxp086</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0010-4620
ispartof Computer journal, 2010-07, Vol.53 (6), p.814-826
issn 0010-4620
1460-2067
language eng
recordid cdi_proquest_miscellaneous_753678845
source Oxford University Press Journals All Titles (1996-Current)
subjects Alignment
Architecture
Array processors
Blades
Boundary element method
Broadband
Complexity
Computation
Computer architecture
Engines
Mailboxes
Mathematical analysis
Microprocessors
Parallel processing
Programming
Running
Vector processing (computers)
Workload
title Accelerating Multiple Sequence Alignment with the Cell BE Processor
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T11%3A02%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Accelerating%20Multiple%20Sequence%20Alignment%20with%20the%20Cell%20BE%20Processor&rft.jtitle=Computer%20journal&rft.au=Vandierendonck,%20Hans&rft.date=2010-07-01&rft.volume=53&rft.issue=6&rft.spage=814&rft.epage=826&rft.pages=814-826&rft.issn=0010-4620&rft.eissn=1460-2067&rft.coden=CMPJAG&rft_id=info:doi/10.1093/comjnl/bxp086&rft_dat=%3Cproquest_cross%3E2097760681%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=734587128&rft_id=info:pmid/&rfr_iscdi=true