BenchFriend: Correlating the performance of GPU benchmarks
Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performan...
Gespeichert in:
Veröffentlicht in: | The international journal of high performance computing applications 2014-05, Vol.28 (2), p.238-250 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 250 |
---|---|
container_issue | 2 |
container_start_page | 238 |
container_title | The international journal of high performance computing applications |
container_volume | 28 |
creator | Che, Shuai Skadron, Kevin |
description | Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions.
In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites. |
doi_str_mv | 10.1177/1094342013507960 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1541439329</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_1094342013507960</sage_id><sourcerecordid>1541439329</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</originalsourceid><addsrcrecordid>eNp1kM1Lw0AQxRdRsFbv3kXwEp3Z2Y_kqMWqUPCi57DZTDQlTepue_C_d0uKSMHTDLzfezM8IS4RbhGtvUMoFCkJSBpsYeBITNAqzGSuzHHak5zt9FNxFuMSAIwiPRHTB-795zy03Nfn4qRxXeSL_ZyK9_nj2-w5W7w-vczuF5knqTeZ95acsoorU5uCwDcVS4MgtcodS8qZicE5hkbbBllTYdibvK49u9pXNBU3Y-46DF9bjpty1UbPXed6HraxRK1QUUGySOjVAboctqFP3yVK2kShkomCkfJhiDFwU65Du3Lhu0Qod-2Uh-0ky_U-2EXvuia43rfx1ydzDUbmlLhs5KL74D_H_8v9AYBNbfY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1527439142</pqid></control><display><type>article</type><title>BenchFriend: Correlating the performance of GPU benchmarks</title><source>SAGE Journals</source><source>Alma/SFX Local Collection</source><creator>Che, Shuai ; Skadron, Kevin</creator><creatorcontrib>Che, Shuai ; Skadron, Kevin</creatorcontrib><description>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions.
In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</description><identifier>ISSN: 1094-3420</identifier><identifier>EISSN: 1741-2846</identifier><identifier>DOI: 10.1177/1094342013507960</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Applied sciences ; Architecture ; Benchmarks ; Central processing units ; Computation ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Correlation ; Correlation analysis ; Exact sciences and technology ; Graphics processing units ; High performance computing ; Language processing and microprogramming ; Parallel processing ; Predictions ; Purchasing ; Scalability ; Software ; Studies ; Workload ; Workloads</subject><ispartof>The international journal of high performance computing applications, 2014-05, Vol.28 (2), p.238-250</ispartof><rights>The Author(s) 2013</rights><rights>2015 INIST-CNRS</rights><rights>Copyright SAGE PUBLICATIONS, INC. May 2014</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/1094342013507960$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/1094342013507960$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21798,27901,27902,43597,43598</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=28506283$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Che, Shuai</creatorcontrib><creatorcontrib>Skadron, Kevin</creatorcontrib><title>BenchFriend: Correlating the performance of GPU benchmarks</title><title>The international journal of high performance computing applications</title><description>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions.
In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</description><subject>Applied sciences</subject><subject>Architecture</subject><subject>Benchmarks</subject><subject>Central processing units</subject><subject>Computation</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Correlation</subject><subject>Correlation analysis</subject><subject>Exact sciences and technology</subject><subject>Graphics processing units</subject><subject>High performance computing</subject><subject>Language processing and microprogramming</subject><subject>Parallel processing</subject><subject>Predictions</subject><subject>Purchasing</subject><subject>Scalability</subject><subject>Software</subject><subject>Studies</subject><subject>Workload</subject><subject>Workloads</subject><issn>1094-3420</issn><issn>1741-2846</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNp1kM1Lw0AQxRdRsFbv3kXwEp3Z2Y_kqMWqUPCi57DZTDQlTepue_C_d0uKSMHTDLzfezM8IS4RbhGtvUMoFCkJSBpsYeBITNAqzGSuzHHak5zt9FNxFuMSAIwiPRHTB-795zy03Nfn4qRxXeSL_ZyK9_nj2-w5W7w-vczuF5knqTeZ95acsoorU5uCwDcVS4MgtcodS8qZicE5hkbbBllTYdibvK49u9pXNBU3Y-46DF9bjpty1UbPXed6HraxRK1QUUGySOjVAboctqFP3yVK2kShkomCkfJhiDFwU65Du3Lhu0Qod-2Uh-0ky_U-2EXvuia43rfx1ydzDUbmlLhs5KL74D_H_8v9AYBNbfY</recordid><startdate>20140501</startdate><enddate>20140501</enddate><creator>Che, Shuai</creator><creator>Skadron, Kevin</creator><general>SAGE Publications</general><general>Sage Publications</general><general>SAGE PUBLICATIONS, INC</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20140501</creationdate><title>BenchFriend</title><author>Che, Shuai ; Skadron, Kevin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-cc73a474eb6d6930cfbe26102548ae238ee3e0aae0f57f1e5396ec68ddceadcb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Applied sciences</topic><topic>Architecture</topic><topic>Benchmarks</topic><topic>Central processing units</topic><topic>Computation</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Correlation</topic><topic>Correlation analysis</topic><topic>Exact sciences and technology</topic><topic>Graphics processing units</topic><topic>High performance computing</topic><topic>Language processing and microprogramming</topic><topic>Parallel processing</topic><topic>Predictions</topic><topic>Purchasing</topic><topic>Scalability</topic><topic>Software</topic><topic>Studies</topic><topic>Workload</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Che, Shuai</creatorcontrib><creatorcontrib>Skadron, Kevin</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>The international journal of high performance computing applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Che, Shuai</au><au>Skadron, Kevin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>BenchFriend: Correlating the performance of GPU benchmarks</atitle><jtitle>The international journal of high performance computing applications</jtitle><date>2014-05-01</date><risdate>2014</risdate><volume>28</volume><issue>2</issue><spage>238</spage><epage>250</epage><pages>238-250</pages><issn>1094-3420</issn><eissn>1741-2846</eissn><abstract>Graphics processing units (GPUs) have become an important platform for general-purpose computing, thanks to their high parallel throughput and high memory bandwidth. GPUs present significantly different architectures from CPUs and require specific mappings and optimizations to achieve high performance. This makes GPU workloads demonstrate application characteristics different from those of CPU workloads. It is critical for researchers to understand the first-order metrics that most influence GPU performance and scalability. Furthermore, methodologies and associated tools are needed to analyze and predict the performance of GPU applications and help guide users’ purchasing decisions.
In this work, we study the approach of predicting the performance of GPU applications by correlating them to existing workloads. One tenet of benchmark design, also a motivation of this paper, is that users should be given the capability to leverage standard workloads to infer the performance of applications of their interest. We first identify a set of important GPU application characteristics and then use them to predict performance of an arbitrary application by determining its most similar proxy benchmarks. We demonstrate the prediction methodology and conduct predictions with benchmarks from different suites to achieve better workload coverage. The experimental results show that we are able to achieve satisfactory performance predictions, although errors are higher for outlier applications. Finally, we discuss several considerations for systematically constructing future benchmark suites.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.1177/1094342013507960</doi><tpages>13</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1094-3420 |
ispartof | The international journal of high performance computing applications, 2014-05, Vol.28 (2), p.238-250 |
issn | 1094-3420 1741-2846 |
language | eng |
recordid | cdi_proquest_miscellaneous_1541439329 |
source | SAGE Journals; Alma/SFX Local Collection |
subjects | Applied sciences Architecture Benchmarks Central processing units Computation Computer science control theory systems Computer systems and distributed systems. User interface Correlation Correlation analysis Exact sciences and technology Graphics processing units High performance computing Language processing and microprogramming Parallel processing Predictions Purchasing Scalability Software Studies Workload Workloads |
title | BenchFriend: Correlating the performance of GPU benchmarks |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T15%3A25%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=BenchFriend:%20Correlating%20the%20performance%20of%20GPU%20benchmarks&rft.jtitle=The%20international%20journal%20of%20high%20performance%20computing%20applications&rft.au=Che,%20Shuai&rft.date=2014-05-01&rft.volume=28&rft.issue=2&rft.spage=238&rft.epage=250&rft.pages=238-250&rft.issn=1094-3420&rft.eissn=1741-2846&rft_id=info:doi/10.1177/1094342013507960&rft_dat=%3Cproquest_cross%3E1541439329%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1527439142&rft_id=info:pmid/&rft_sage_id=10.1177_1094342013507960&rfr_iscdi=true |