Parallel solver for shifted systems in a hybrid CPU-GPU framework

This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Bosner, Nela, Bujanović, Zvonimir, Drmač, Zlatko
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Bosner, Nela
Bujanović, Zvonimir
Drmač, Zlatko
description This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side matrix $B$ to the controller Hessenberg canonical form that facilitates efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and $B$ is made upper-triangular. This is implemented as blocked highly parallel CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.
doi_str_mv 10.48550/arxiv.1708.06290
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1708_06290</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1708_06290</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-71e27909a2586661e3b7ed5284841f90f3a9e5857e41300d64a102ccc58f7eba3</originalsourceid><addsrcrecordid>eNotz71OwzAUQGEvDKjwAEz1CyS9duK_sYpoQapEhnaObpJr1cIhyK4KeXtEYTrbkT7GngSUtVUKNpi-w7UUBmwJWjq4Z9sWE8ZIkec5XilxPyeez8FfaOR5yReaMg8fHPl56VMYedOein174j7hRF9zen9gdx5jpsf_rthx93xsXorD2_612R4K1AYKI0gaBw6lslprQVVvaFTS1rYW3oGv0JGyylAtKoBR1yhADsOgrDfUY7Vi67_tzdB9pjBhWrpfS3ezVD_EgULK</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><source>arXiv.org</source><creator>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</creator><creatorcontrib>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</creatorcontrib><description>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side matrix $B$ to the controller Hessenberg canonical form that facilitates efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and $B$ is made upper-triangular. This is implemented as blocked highly parallel CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.</description><identifier>DOI: 10.48550/arxiv.1708.06290</identifier><language>eng</language><subject>Computer Science - Mathematical Software ; Computer Science - Numerical Analysis</subject><creationdate>2017-08</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1708.06290$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1708.06290$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Bosner, Nela</creatorcontrib><creatorcontrib>Bujanović, Zvonimir</creatorcontrib><creatorcontrib>Drmač, Zlatko</creatorcontrib><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><description>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side matrix $B$ to the controller Hessenberg canonical form that facilitates efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and $B$ is made upper-triangular. This is implemented as blocked highly parallel CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.</description><subject>Computer Science - Mathematical Software</subject><subject>Computer Science - Numerical Analysis</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUQGEvDKjwAEz1CyS9duK_sYpoQapEhnaObpJr1cIhyK4KeXtEYTrbkT7GngSUtVUKNpi-w7UUBmwJWjq4Z9sWE8ZIkec5XilxPyeez8FfaOR5yReaMg8fHPl56VMYedOein174j7hRF9zen9gdx5jpsf_rthx93xsXorD2_612R4K1AYKI0gaBw6lslprQVVvaFTS1rYW3oGv0JGyylAtKoBR1yhADsOgrDfUY7Vi67_tzdB9pjBhWrpfS3ezVD_EgULK</recordid><startdate>20170821</startdate><enddate>20170821</enddate><creator>Bosner, Nela</creator><creator>Bujanović, Zvonimir</creator><creator>Drmač, Zlatko</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20170821</creationdate><title>Parallel solver for shifted systems in a hybrid CPU-GPU framework</title><author>Bosner, Nela ; Bujanović, Zvonimir ; Drmač, Zlatko</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-71e27909a2586661e3b7ed5284841f90f3a9e5857e41300d64a102ccc58f7eba3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Computer Science - Mathematical Software</topic><topic>Computer Science - Numerical Analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>Bosner, Nela</creatorcontrib><creatorcontrib>Bujanović, Zvonimir</creatorcontrib><creatorcontrib>Drmač, Zlatko</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bosner, Nela</au><au>Bujanović, Zvonimir</au><au>Drmač, Zlatko</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel solver for shifted systems in a hybrid CPU-GPU framework</atitle><date>2017-08-21</date><risdate>2017</risdate><abstract>This paper proposes a combination of a hybrid CPU--GPU and a pure GPU software implementation of a direct algorithm for solving shifted linear systems $(A - \sigma I)X = B$ with large number of complex shifts $\sigma$ and multiple right-hand sides. Such problems often appear e.g. in control theory when evaluating the transfer function, or as a part of an algorithm performing interpolatory model reduction, as well as when computing pseudospectra and structured pseudospectra, or solving large linear systems of ordinary differential equations. The proposed algorithm first jointly reduces the general full $n\times n$ matrix $A$ and the $n\times m$ full right-hand side matrix $B$ to the controller Hessenberg canonical form that facilitates efficient solution: $A$ is transformed to a so-called $m$-Hessenberg form and $B$ is made upper-triangular. This is implemented as blocked highly parallel CPU--GPU hybrid algorithm; individual blocks are reduced by the CPU, and the necessary updates of the rest of the matrix are split among the cores of the CPU and the GPU. To enhance parallelization, the reduction and the updates are overlapped. In the next phase, the reduced $m$-Hessenberg--triangular systems are solved entirely on the GPU, with shifts divided into batches. The benefits of such load distribution are demonstrated by numerical experiments. In particular, we show that our proposed implementation provides an excellent basis for efficient implementations of computational methods in systems and control theory, from evaluation of transfer function to the interpolatory model reduction.</abstract><doi>10.48550/arxiv.1708.06290</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1708.06290
ispartof
issn
language eng
recordid cdi_arxiv_primary_1708_06290
source arXiv.org
subjects Computer Science - Mathematical Software
Computer Science - Numerical Analysis
title Parallel solver for shifted systems in a hybrid CPU-GPU framework
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T16%3A27%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20solver%20for%20shifted%20systems%20in%20a%20hybrid%20CPU-GPU%20framework&rft.au=Bosner,%20Nela&rft.date=2017-08-21&rft_id=info:doi/10.48550/arxiv.1708.06290&rft_dat=%3Carxiv_GOX%3E1708_06290%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true