BenchFriend: Correlating the performance of GPU benchmarks

Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The international journal of high performance computing applications 2014-05, Vol.28 (2), p.238-250
Hauptverfasser: Che, Shuai, Skadron, Kevin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 250
container_issue 2
container_start_page 238
container_title The international journal of high performance computing applications
container_volume 28
creator Che, Shuai
Skadron, Kevin
description Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions. In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.
doi_str_mv 10.1177/1094342013507960
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1541439329</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_1094342013507960</sage_id><sourcerecordid>1541439329</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</originalsourceid><addsrcrecordid>eNp1kM1Lw0AQxRdRsFbv3kXwEp3Z2Y_kqMWqUPCi57DZTDQlTepue_C_d0uKSMHTDLzfezM8IS4RbhGtvUMoFCkJSBpsYeBITNAqzGSuzHHak5zt9FNxFuMSAIwiPRHTB-795zy03Nfn4qRxXeSL_ZyK9_nj2-w5W7w-vczuF5knqTeZ95acsoorU5uCwDcVS4MgtcodS8qZicE5hkbbBllTYdibvK49u9pXNBU3Y-46DF9bjpty1UbPXed6HraxRK1QUUGySOjVAboctqFP3yVK2kShkomCkfJhiDFwU65Du3Lhu0Qod-2Uh-0ky_U-2EXvuia43rfx1ydzDUbmlLhs5KL74D_H_8v9AYBNbfY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1527439142</pqid></control><display><type>article</type><title>BenchFriend: Correlating the performance of GPU benchmarks</title><source>SAGE Journals</source><source>Alma/SFX Local Collection</source><creator>Che, Shuai ; Skadron, Kevin</creator><creatorcontrib>Che, Shuai ; Skadron, Kevin</creatorcontrib><description>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions. In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</description><identifier>ISSN: 1094-3420</identifier><identifier>EISSN: 1741-2846</identifier><identifier>DOI: 10.1177/1094342013507960</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Applied sciences ; Architecture ; Benchmarks ; Central processing units ; Computation ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Correlation ; Correlation analysis ; Exact sciences and technology ; Graphics processing units ; High performance computing ; Language processing and microprogramming ; Parallel processing ; Predictions ; Purchasing ; Scalability ; Software ; Studies ; Workload ; Workloads</subject><ispartof>The international journal of high performance computing applications, 2014-05, Vol.28 (2), p.238-250</ispartof><rights>The Author(s) 2013</rights><rights>2015 INIST-CNRS</rights><rights>Copyright SAGE PUBLICATIONS, INC. May 2014</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/1094342013507960$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/1094342013507960$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21798,27901,27902,43597,43598</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=28506283$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Che, Shuai</creatorcontrib><creatorcontrib>Skadron, Kevin</creatorcontrib><title>BenchFriend: Correlating the performance of GPU benchmarks</title><title>The international journal of high performance computing applications</title><description>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions. In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</description><subject>Applied sciences</subject><subject>Architecture</subject><subject>Benchmarks</subject><subject>Central processing units</subject><subject>Computation</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Correlation</subject><subject>Correlation analysis</subject><subject>Exact sciences and technology</subject><subject>Graphics processing units</subject><subject>High performance computing</subject><subject>Language processing and microprogramming</subject><subject>Parallel processing</subject><subject>Predictions</subject><subject>Purchasing</subject><subject>Scalability</subject><subject>Software</subject><subject>Studies</subject><subject>Workload</subject><subject>Workloads</subject><issn>1094-3420</issn><issn>1741-2846</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNp1kM1Lw0AQxRdRsFbv3kXwEp3Z2Y_kqMWqUPCi57DZTDQlTepue_C_d0uKSMHTDLzfezM8IS4RbhGtvUMoFCkJSBpsYeBITNAqzGSuzHHak5zt9FNxFuMSAIwiPRHTB-795zy03Nfn4qRxXeSL_ZyK9_nj2-w5W7w-vczuF5knqTeZ95acsoorU5uCwDcVS4MgtcodS8qZicE5hkbbBllTYdibvK49u9pXNBU3Y-46DF9bjpty1UbPXed6HraxRK1QUUGySOjVAboctqFP3yVK2kShkomCkfJhiDFwU65Du3Lhu0Qod-2Uh-0ky_U-2EXvuia43rfx1ydzDUbmlLhs5KL74D_H_8v9AYBNbfY</recordid><startdate>20140501</startdate><enddate>20140501</enddate><creator>Che, Shuai</creator><creator>Skadron, Kevin</creator><general>SAGE Publications</general><general>Sage Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140501</creationdate><title>BenchFriend</title><author>Che, Shuai ; Skadron, Kevin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Applied sciences</topic><topic>Architecture</topic><topic>Benchmarks</topic><topic>Central processing units</topic><topic>Computation</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Correlation</topic><topic>Correlation analysis</topic><topic>Exact sciences and technology</topic><topic>Graphics processing units</topic><topic>High performance computing</topic><topic>Language processing and microprogramming</topic><topic>Parallel processing</topic><topic>Predictions</topic><topic>Purchasing</topic><topic>Scalability</topic><topic>Software</topic><topic>Studies</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Che, Shuai</creatorcontrib><creatorcontrib>Skadron, Kevin</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>The international journal of high performance computing applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Che, Shuai</au><au>Skadron, Kevin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BenchFriend: Correlating the performance of GPU benchmarks</atitle><jtitle>The international journal of high performance computing applications</jtitle><date>2014-05-01</date><risdate>2014</risdate><volume>28</volume><issue>2</issue><spage>238</spage><epage>250</epage><pages>238-250</pages><issn>1094-3420</issn><eissn>1741-2846</eissn><abstract>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions. In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.1177/1094342013507960</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1094-3420
ispartof The international journal of high performance computing applications, 2014-05, Vol.28 (2), p.238-250
issn 1094-3420
1741-2846
language eng
recordid cdi_proquest_miscellaneous_1541439329
source SAGE Journals; Alma/SFX Local Collection
subjects Applied sciences
Architecture
Benchmarks
Central processing units
Computation
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Correlation
Correlation analysis
Exact sciences and technology
Graphics processing units
High performance computing
Language processing and microprogramming
Parallel processing
Predictions
Purchasing
Scalability
Software
Studies
Workload
Workloads
title BenchFriend: Correlating the performance of GPU benchmarks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T15%3A25%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BenchFriend:%20Correlating%20the%20performance%20of%20GPU%20benchmarks&rft.jtitle=The%20international%20journal%20of%20high%20performance%20computing%20applications&rft.au=Che,%20Shuai&rft.date=2014-05-01&rft.volume=28&rft.issue=2&rft.spage=238&rft.epage=250&rft.pages=238-250&rft.issn=1094-3420&rft.eissn=1741-2846&rft_id=info:doi/10.1177/1094342013507960&rft_dat=%3Cproquest_cross%3E1541439329%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1527439142&rft_id=info:pmid/&rft_sage_id=10.1177_1094342013507960&rfr_iscdi=true