Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers
A Choleski method used to solve linear systems of equations that arise in large scale structural analyses is described. The method uses a novel variable-band stroage scheme and is structured to exploit fast local memory caches while minimizing data access delays between main memory and vector regist...
Gespeichert in:
Veröffentlicht in: | Computing systems in engineering 1991, Vol.2 (2), p.183-196 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 196 |
---|---|
container_issue | 2 |
container_start_page | 183 |
container_title | Computing systems in engineering |
container_volume | 2 |
creator | Poole, E.L. Overman, A.L. |
description | A Choleski method used to solve linear systems of equations that arise in large scale structural analyses is described. The method uses a novel variable-band stroage scheme and is structured to exploit fast local memory caches while minimizing data access delays between main memory and vector registers. Several parallel implementations of this method are described for the CRAY-2 and CRAY Y-MP computers demonstrating the use of microtasking and autotasking directives. A portable parallel language, FORCE, is also used for two different parallel implementations, demonstrating the use of CRAY macrotasking. Results are presented comparing the matrix factorization times for three representative structural analysis problems from runs made in both dedicated and multi-user modes on both the CRAY-2 and CRAY Y-MP computers. CPU and wall clock timings are given for the various parallel methods and are compared to single processor timings of the same algorithm. Computation rates over 1 GIGAFLOP (1 billion floating point operations per second) on a four processor CRAY-2 and over 2 GIGAFLOPS on an eight processor CRAY Y-MP are demonstrated as measured by wall clock time in a dedicated environment. Reduced wall clock times for the parallel methods relative to the single processor implementation of the same Choleski algorithm are also demonstrated for runs made in multi-user mode. |
doi_str_mv | 10.1016/0956-0521(91)90019-2 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_25384414</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>0956052191900192</els_id><sourcerecordid>25384414</sourcerecordid><originalsourceid>FETCH-LOGICAL-c271t-78e56dbb67415b146816a17166f41dcdd805ce79f2e62064b3e2af5c33cb12413</originalsourceid><addsrcrecordid>eNp9kMGL1TAQxoso-Fz9D_aQk-ihayZJ0-YiyGN1hQU96Dmk6RSjeU3NpA_26l9unhWPwsAwzPd9zPya5hr4DXDQb7jpdMs7Aa8MvDacg2nFo-YAQy9bLQAeN4d_kqfNM6LvnAvJgR-aX59ddjFiZGeXgxsjtqNbJnb8liLSj8AoxTNmYnPKzKfTuhVXQlpcZFTy5stW7czV-YECMbeuMfg_CmJpYWf0pRpPWyxhzckjUR1pWzHvYTX6efNkdpHwxd9-1Xx9f_vleNfef_rw8fjuvvWih9L2A3Z6GkfdK-hGUHoA7aAHrWcFk5-mgXceezML1IJrNUoUbu68lH4EoUBeNS_33HrIzw2p2FMgjzG6BdNGVnRyUApUFapd6HMiyjjbNYeTyw8WuL0Atxea9kLTmloX4FZU2_VuWxw5u5RMFowRFbXppazrt_sa64_ngNmSD7h4nEKulOyUwv_zfwNE-ZP7</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>25384414</pqid></control><display><type>article</type><title>Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers</title><source>NASA Technical Reports Server</source><source>Alma/SFX Local Collection</source><creator>Poole, E.L. ; Overman, A.L.</creator><creatorcontrib>Poole, E.L. ; Overman, A.L.</creatorcontrib><description>A Choleski method used to solve linear systems of equations that arise in large scale structural analyses is described. The method uses a novel variable-band stroage scheme and is structured to exploit fast local memory caches while minimizing data access delays between main memory and vector registers. Several parallel implementations of this method are described for the CRAY-2 and CRAY Y-MP computers demonstrating the use of microtasking and autotasking directives. A portable parallel language, FORCE, is also used for two different parallel implementations, demonstrating the use of CRAY macrotasking. Results are presented comparing the matrix factorization times for three representative structural analysis problems from runs made in both dedicated and multi-user modes on both the CRAY-2 and CRAY Y-MP computers. CPU and wall clock timings are given for the various parallel methods and are compared to single processor timings of the same algorithm. Computation rates over 1 GIGAFLOP (1 billion floating point operations per second) on a four processor CRAY-2 and over 2 GIGAFLOPS on an eight processor CRAY Y-MP are demonstrated as measured by wall clock time in a dedicated environment. Reduced wall clock times for the parallel methods relative to the single processor implementation of the same Choleski algorithm are also demonstrated for runs made in multi-user mode.</description><identifier>ISSN: 0956-0521</identifier><identifier>EISSN: 1873-6211</identifier><identifier>DOI: 10.1016/0956-0521(91)90019-2</identifier><language>eng</language><publisher>Legacy CDMS: Elsevier B.V</publisher><subject>Computer Programming And Software</subject><ispartof>Computing systems in engineering, 1991, Vol.2 (2), p.183-196</ispartof><rights>1991</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c271t-78e56dbb67415b146816a17166f41dcdd805ce79f2e62064b3e2af5c33cb12413</citedby><cites>FETCH-LOGICAL-c271t-78e56dbb67415b146816a17166f41dcdd805ce79f2e62064b3e2af5c33cb12413</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,4010,27900,27901,27902</link.rule.ids></links><search><creatorcontrib>Poole, E.L.</creatorcontrib><creatorcontrib>Overman, A.L.</creatorcontrib><title>Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers</title><title>Computing systems in engineering</title><description>A Choleski method used to solve linear systems of equations that arise in large scale structural analyses is described. The method uses a novel variable-band stroage scheme and is structured to exploit fast local memory caches while minimizing data access delays between main memory and vector registers. Several parallel implementations of this method are described for the CRAY-2 and CRAY Y-MP computers demonstrating the use of microtasking and autotasking directives. A portable parallel language, FORCE, is also used for two different parallel implementations, demonstrating the use of CRAY macrotasking. Results are presented comparing the matrix factorization times for three representative structural analysis problems from runs made in both dedicated and multi-user modes on both the CRAY-2 and CRAY Y-MP computers. CPU and wall clock timings are given for the various parallel methods and are compared to single processor timings of the same algorithm. Computation rates over 1 GIGAFLOP (1 billion floating point operations per second) on a four processor CRAY-2 and over 2 GIGAFLOPS on an eight processor CRAY Y-MP are demonstrated as measured by wall clock time in a dedicated environment. Reduced wall clock times for the parallel methods relative to the single processor implementation of the same Choleski algorithm are also demonstrated for runs made in multi-user mode.</description><subject>Computer Programming And Software</subject><issn>0956-0521</issn><issn>1873-6211</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1991</creationdate><recordtype>article</recordtype><sourceid>CYI</sourceid><recordid>eNp9kMGL1TAQxoso-Fz9D_aQk-ihayZJ0-YiyGN1hQU96Dmk6RSjeU3NpA_26l9unhWPwsAwzPd9zPya5hr4DXDQb7jpdMs7Aa8MvDacg2nFo-YAQy9bLQAeN4d_kqfNM6LvnAvJgR-aX59ddjFiZGeXgxsjtqNbJnb8liLSj8AoxTNmYnPKzKfTuhVXQlpcZFTy5stW7czV-YECMbeuMfg_CmJpYWf0pRpPWyxhzckjUR1pWzHvYTX6efNkdpHwxd9-1Xx9f_vleNfef_rw8fjuvvWih9L2A3Z6GkfdK-hGUHoA7aAHrWcFk5-mgXceezML1IJrNUoUbu68lH4EoUBeNS_33HrIzw2p2FMgjzG6BdNGVnRyUApUFapd6HMiyjjbNYeTyw8WuL0Atxea9kLTmloX4FZU2_VuWxw5u5RMFowRFbXppazrt_sa64_ngNmSD7h4nEKulOyUwv_zfwNE-ZP7</recordid><startdate>1991</startdate><enddate>1991</enddate><creator>Poole, E.L.</creator><creator>Overman, A.L.</creator><general>Elsevier B.V</general><scope>CYE</scope><scope>CYI</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope></search><sort><creationdate>1991</creationdate><title>Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers</title><author>Poole, E.L. ; Overman, A.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c271t-78e56dbb67415b146816a17166f41dcdd805ce79f2e62064b3e2af5c33cb12413</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1991</creationdate><topic>Computer Programming And Software</topic><toplevel>online_resources</toplevel><creatorcontrib>Poole, E.L.</creatorcontrib><creatorcontrib>Overman, A.L.</creatorcontrib><collection>NASA Scientific and Technical Information</collection><collection>NASA Technical Reports Server</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>Computing systems in engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Poole, E.L.</au><au>Overman, A.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers</atitle><jtitle>Computing systems in engineering</jtitle><date>1991</date><risdate>1991</risdate><volume>2</volume><issue>2</issue><spage>183</spage><epage>196</epage><pages>183-196</pages><issn>0956-0521</issn><eissn>1873-6211</eissn><abstract>A Choleski method used to solve linear systems of equations that arise in large scale structural analyses is described. The method uses a novel variable-band stroage scheme and is structured to exploit fast local memory caches while minimizing data access delays between main memory and vector registers. Several parallel implementations of this method are described for the CRAY-2 and CRAY Y-MP computers demonstrating the use of microtasking and autotasking directives. A portable parallel language, FORCE, is also used for two different parallel implementations, demonstrating the use of CRAY macrotasking. Results are presented comparing the matrix factorization times for three representative structural analysis problems from runs made in both dedicated and multi-user modes on both the CRAY-2 and CRAY Y-MP computers. CPU and wall clock timings are given for the various parallel methods and are compared to single processor timings of the same algorithm. Computation rates over 1 GIGAFLOP (1 billion floating point operations per second) on a four processor CRAY-2 and over 2 GIGAFLOPS on an eight processor CRAY Y-MP are demonstrated as measured by wall clock time in a dedicated environment. Reduced wall clock times for the parallel methods relative to the single processor implementation of the same Choleski algorithm are also demonstrated for runs made in multi-user mode.</abstract><cop>Legacy CDMS</cop><pub>Elsevier B.V</pub><doi>10.1016/0956-0521(91)90019-2</doi><tpages>14</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0956-0521 |
ispartof | Computing systems in engineering, 1991, Vol.2 (2), p.183-196 |
issn | 0956-0521 1873-6211 |
language | eng |
recordid | cdi_proquest_miscellaneous_25384414 |
source | NASA Technical Reports Server; Alma/SFX Local Collection |
subjects | Computer Programming And Software |
title | Parallel variable-band Choleski solvers for computational structural analysis applications on vector multiprocessor supercomputers |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T02%3A01%3A08IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20variable-band%20Choleski%20solvers%20for%20computational%20structural%20analysis%20applications%20on%20vector%20multiprocessor%20supercomputers&rft.jtitle=Computing%20systems%20in%20engineering&rft.au=Poole,%20E.L.&rft.date=1991&rft.volume=2&rft.issue=2&rft.spage=183&rft.epage=196&rft.pages=183-196&rft.issn=0956-0521&rft.eissn=1873-6211&rft_id=info:doi/10.1016/0956-0521(91)90019-2&rft_dat=%3Cproquest_cross%3E25384414%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=25384414&rft_id=info:pmid/&rft_els_id=0956052191900192&rfr_iscdi=true |