Accelerating Multiple Sequence Alignment with the Cell BE Processor
The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorizatio...
Gespeichert in:
Veröffentlicht in: | Computer journal 2010-07, Vol.53 (6), p.814-826 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 826 |
---|---|
container_issue | 6 |
container_start_page | 814 |
container_title | Computer journal |
container_volume | 53 |
creator | Vandierendonck, Hans Rul, Sean De Bosschere, Koen |
description | The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT] |
doi_str_mv | 10.1093/comjnl/bxp086 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_753678845</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2097760681</sourcerecordid><originalsourceid>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</originalsourceid><addsrcrecordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>734587128</pqid></control><display><type>article</type><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creator><creatorcontrib>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</creatorcontrib><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><identifier>ISSN: 0010-4620</identifier><identifier>EISSN: 1460-2067</identifier><identifier>DOI: 10.1093/comjnl/bxp086</identifier><identifier>CODEN: CMPJAG</identifier><language>eng</language><publisher>Oxford: Oxford Publishing Limited (England)</publisher><subject>Alignment ; Architecture ; Array processors ; Blades ; Boundary element method ; Broadband ; Complexity ; Computation ; Computer architecture ; Engines ; Mailboxes ; Mathematical analysis ; Microprocessors ; Parallel processing ; Programming ; Running ; Vector processing (computers) ; Workload</subject><ispartof>Computer journal, 2010-07, Vol.53 (6), p.814-826</ispartof><rights>Copyright Oxford Publishing Limited(England) Jul 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,27926,27927</link.rule.ids></links><search><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><title>Computer journal</title><description>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</description><subject>Alignment</subject><subject>Architecture</subject><subject>Array processors</subject><subject>Blades</subject><subject>Boundary element method</subject><subject>Broadband</subject><subject>Complexity</subject><subject>Computation</subject><subject>Computer architecture</subject><subject>Engines</subject><subject>Mailboxes</subject><subject>Mathematical analysis</subject><subject>Microprocessors</subject><subject>Parallel processing</subject><subject>Programming</subject><subject>Running</subject><subject>Vector processing (computers)</subject><subject>Workload</subject><issn>0010-4620</issn><issn>1460-2067</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNpdkDtPwzAURi0EEqUwslssTKHXdmI7Y4nKQyoCCZijxLluUzkP7ETAvycoTEzfcvTp6BByyeCGQSpWpmsOrVuVXz1oeUQWLJYQcZDqmCwAGESx5HBKzkI4AACHVC5ItjYGHfpiqNsdfRrdUPcO6St-jNgapGtX79oG24F-1sOeDnukGTpHbzf0xXcGQ-j8OTmxhQt48bdL8n63ecseou3z_WO23kaGp3KYTBLOuSqVSmxSCQPWMIUmxUIYWyKyVJeV0iUwqUXFWJUwq0ycShtLa6pYLMn1_Nv7btILQ97UYbJ3RYvdGHKVCKm0jpOJvPpHHrrRt5NcrkScaMW4nqBohozvQvBo897XTeG_cwb5b9B8DprPQcUP-uVqpA</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Vandierendonck, Hans</creator><creator>Rul, Sean</creator><creator>De Bosschere, Koen</creator><general>Oxford Publishing Limited (England)</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Accelerating Multiple Sequence Alignment with the Cell BE Processor</title><author>Vandierendonck, Hans ; Rul, Sean ; De Bosschere, Koen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c296t-2052227b775f5d3c0fc17ec9ea3cfbee198bd78b01683d11d51f7c496f46fcd43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Alignment</topic><topic>Architecture</topic><topic>Array processors</topic><topic>Blades</topic><topic>Boundary element method</topic><topic>Broadband</topic><topic>Complexity</topic><topic>Computation</topic><topic>Computer architecture</topic><topic>Engines</topic><topic>Mailboxes</topic><topic>Mathematical analysis</topic><topic>Microprocessors</topic><topic>Parallel processing</topic><topic>Programming</topic><topic>Running</topic><topic>Vector processing (computers)</topic><topic>Workload</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vandierendonck, Hans</creatorcontrib><creatorcontrib>Rul, Sean</creatorcontrib><creatorcontrib>De Bosschere, Koen</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Computer journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vandierendonck, Hans</au><au>Rul, Sean</au><au>De Bosschere, Koen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Accelerating Multiple Sequence Alignment with the Cell BE Processor</atitle><jtitle>Computer journal</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>53</volume><issue>6</issue><spage>814</spage><epage>826</epage><pages>814-826</pages><issn>0010-4620</issn><eissn>1460-2067</eissn><coden>CMPJAG</coden><abstract>The Cell Broadband Engine (BE) Architecture is a new heterogeneous multi-core architecture targeted at compute-intensive workloads. The architecture of the Cell BE has several features that are unique in high-performance general-purpose processors, most notably the extensive support for vectorization, scratch pad memories and explicit programming of direct memory accesses (DMAs) and mailbox communication. While these features strongly increase programming complexity, it is generally claimed that significant speedups can be obtained by using Cell BE processors. This paper presents our experiences with using the Cell BE architecture to accelerate Clustal W, a bio-informatics program for multiple sequence alignment. We report on how we apply the unique features of the Cell BE to Clustal W and how important each is in obtaining high performance. By making extensive use of vectorization and by parallelizing the application across all cores, we demonstrate a speedup of 24.4 times when using 16 synergistic processor units on a QS21 Cell Blade compared to single-thread execution on the power processing unit. As the Cell BE exploits a large number of slim cores, our highly optimized implementation is just 3.8 times faster than a 3-thread version running on an Intel Core2 Duo, as the latter processor exploits a small number of fat cores. [PUBLICATION ABSTRACT]</abstract><cop>Oxford</cop><pub>Oxford Publishing Limited (England)</pub><doi>10.1093/comjnl/bxp086</doi><tpages>13</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0010-4620 |
ispartof | Computer journal, 2010-07, Vol.53 (6), p.814-826 |
issn | 0010-4620 1460-2067 |
language | eng |
recordid | cdi_proquest_miscellaneous_753678845 |
source | Oxford University Press Journals All Titles (1996-Current) |
subjects | Alignment Architecture Array processors Blades Boundary element method Broadband Complexity Computation Computer architecture Engines Mailboxes Mathematical analysis Microprocessors Parallel processing Programming Running Vector processing (computers) Workload |
title | Accelerating Multiple Sequence Alignment with the Cell BE Processor |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T11%3A02%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Accelerating%20Multiple%20Sequence%20Alignment%20with%20the%20Cell%20BE%20Processor&rft.jtitle=Computer%20journal&rft.au=Vandierendonck,%20Hans&rft.date=2010-07-01&rft.volume=53&rft.issue=6&rft.spage=814&rft.epage=826&rft.pages=814-826&rft.issn=0010-4620&rft.eissn=1460-2067&rft.coden=CMPJAG&rft_id=info:doi/10.1093/comjnl/bxp086&rft_dat=%3Cproquest_cross%3E2097760681%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=734587128&rft_id=info:pmid/&rfr_iscdi=true |