Parallel solver for shifted systems in a hybrid CPU-GPU framework
This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory w...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Bosner, Nela Bujanović, Zvonimir Drmač, Zlatko |
description | This paper proposes a combination of a hybrid CPU--GPU and a pure GPU
software implementation of a direct algorithm for solving shifted linear
systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and
multiple right-hand sides. Such problems often appear e.g. in control theory
when evaluating the transfer function, or as a part of an algorithm performing
interpolatory model reduction, as well as when computing pseudospectra and
structured pseudospectra, or solving large linear systems of ordinary
differential equations. The proposed algorithm first jointly reduces the
general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side
matrix $B$ to the controller Hessenberg canonical form that facilitates
efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and
$B$ is made upper-triangular. This is implemented as blocked highly parallel
CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the
necessary updates of the rest of the matrix are split among the cores of the
CPU and the GPU. To enhance parallelization, the reduction and the updates are
overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems
are solved entirely on the GPU, with shifts divided into batches. The benefits
of such load distribution are demonstrated by numerical experiments. In
particular, we show that our proposed implementation provides an excellent
basis for efficient implementations of computational methods in systems and
control theory, from evaluation of transfer function to the interpolatory model
reduction. |
doi_str_mv | 10.48550/arxiv.1708.06290 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1708_06290</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1708_06290</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-71e27909a2586661e3b7ed5284841f90f3a9e5857e41300d64a102ccc58f7eba3</originalsourceid><addsrcrecordid>eNotz71OwzAUQGEvDKjwAEz1CyS9duK_sYpoQapEhnaObpJr1cIhyK4KeXtEYTrbkT7GngSUtVUKNpi-w7UUBmwJWjq4Z9sWE8ZIkec5XilxPyeez8FfaOR5yReaMg8fHPl56VMYedOein174j7hRF9zen9gdx5jpsf_rthx93xsXorD2_612R4K1AYKI0gaBw6lslprQVVvaFTS1rYW3oGv0JGyylAtKoBR1yhADsOgrDfUY7Vi67_tzdB9pjBhWrpfS3ezVD_EgULK</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><source>arXiv.org</source><creator>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</creator><creatorcontrib>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</creatorcontrib><description>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU
software implementation of a direct algorithm for solving shifted linear
systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and
multiple right-hand sides. Such problems often appear e.g. in control theory
when evaluating the transfer function, or as a part of an algorithm performing
interpolatory model reduction, as well as when computing pseudospectra and
structured pseudospectra, or solving large linear systems of ordinary
differential equations. The proposed algorithm first jointly reduces the
general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side
matrix $B$ to the controller Hessenberg canonical form that facilitates
efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and
$B$ is made upper-triangular. This is implemented as blocked highly parallel
CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the
necessary updates of the rest of the matrix are split among the cores of the
CPU and the GPU. To enhance parallelization, the reduction and the updates are
overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems
are solved entirely on the GPU, with shifts divided into batches. The benefits
of such load distribution are demonstrated by numerical experiments. In
particular, we show that our proposed implementation provides an excellent
basis for efficient implementations of computational methods in systems and
control theory, from evaluation of transfer function to the interpolatory model
reduction.</description><identifier>DOI: 10.48550/arxiv.1708.06290</identifier><language>eng</language><subject>Computer Science - Mathematical Software ; Computer Science - Numerical Analysis</subject><creationdate>2017-08</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1708.06290$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1708.06290$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bosner, Nela</creatorcontrib><creatorcontrib>Bujanović, Zvonimir</creatorcontrib><creatorcontrib>Drmač, Zlatko</creatorcontrib><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><description>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU
software implementation of a direct algorithm for solving shifted linear
systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and
multiple right-hand sides. Such problems often appear e.g. in control theory
when evaluating the transfer function, or as a part of an algorithm performing
interpolatory model reduction, as well as when computing pseudospectra and
structured pseudospectra, or solving large linear systems of ordinary
differential equations. The proposed algorithm first jointly reduces the
general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side
matrix $B$ to the controller Hessenberg canonical form that facilitates
efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and
$B$ is made upper-triangular. This is implemented as blocked highly parallel
CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the
necessary updates of the rest of the matrix are split among the cores of the
CPU and the GPU. To enhance parallelization, the reduction and the updates are
overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems
are solved entirely on the GPU, with shifts divided into batches. The benefits
of such load distribution are demonstrated by numerical experiments. In
particular, we show that our proposed implementation provides an excellent
basis for efficient implementations of computational methods in systems and
control theory, from evaluation of transfer function to the interpolatory model
reduction.</description><subject>Computer Science - Mathematical Software</subject><subject>Computer Science - Numerical Analysis</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUQGEvDKjwAEz1CyS9duK_sYpoQapEhnaObpJr1cIhyK4KeXtEYTrbkT7GngSUtVUKNpi-w7UUBmwJWjq4Z9sWE8ZIkec5XilxPyeez8FfaOR5yReaMg8fHPl56VMYedOein174j7hRF9zen9gdx5jpsf_rthx93xsXorD2_612R4K1AYKI0gaBw6lslprQVVvaFTS1rYW3oGv0JGyylAtKoBR1yhADsOgrDfUY7Vi67_tzdB9pjBhWrpfS3ezVD_EgULK</recordid><startdate>20170821</startdate><enddate>20170821</enddate><creator>Bosner, Nela</creator><creator>Bujanović, Zvonimir</creator><creator>Drmač, Zlatko</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20170821</creationdate><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><author>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-71e27909a2586661e3b7ed5284841f90f3a9e5857e41300d64a102ccc58f7eba3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science - Mathematical Software</topic><topic>Computer Science - Numerical Analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Bosner, Nela</creatorcontrib><creatorcontrib>Bujanović, Zvonimir</creatorcontrib><creatorcontrib>Drmač, Zlatko</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bosner, Nela</au><au>Bujanović, Zvonimir</au><au>Drmač, Zlatko</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel solver for shifted systems in a hybrid CPU-GPU framework</atitle><date>2017-08-21</date><risdate>2017</risdate><abstract>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU
software implementation of a direct algorithm for solving shifted linear
systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and
multiple right-hand sides. Such problems often appear e.g. in control theory
when evaluating the transfer function, or as a part of an algorithm performing
interpolatory model reduction, as well as when computing pseudospectra and
structured pseudospectra, or solving large linear systems of ordinary
differential equations. The proposed algorithm first jointly reduces the
general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side
matrix $B$ to the controller Hessenberg canonical form that facilitates
efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and
$B$ is made upper-triangular. This is implemented as blocked highly parallel
CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the
necessary updates of the rest of the matrix are split among the cores of the
CPU and the GPU. To enhance parallelization, the reduction and the updates are
overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems
are solved entirely on the GPU, with shifts divided into batches. The benefits
of such load distribution are demonstrated by numerical experiments. In
particular, we show that our proposed implementation provides an excellent
basis for efficient implementations of computational methods in systems and
control theory, from evaluation of transfer function to the interpolatory model
reduction.</abstract><doi>10.48550/arxiv.1708.06290</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.1708.06290 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_1708_06290 |
source | arXiv.org |
subjects | Computer Science - Mathematical Software Computer Science - Numerical Analysis |
title | Parallel solver for shifted systems in a hybrid CPU-GPU framework |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T16%3A27%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20solver%20for%20shifted%20systems%20in%20a%20hybrid%20CPU-GPU%20framework&rft.au=Bosner,%20Nela&rft.date=2017-08-21&rft_id=info:doi/10.48550/arxiv.1708.06290&rft_dat=%3Carxiv_GOX%3E1708_06290%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |