Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization
There is a large space of NUMA and hardware prefetcher configurations that can significantly impact the performance of an application. Previous studies have demonstrated how a model can automatically select configurations based on the dynamic properties of the code to achieve speedups. This paper de...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | TehraniJamsaz, Ali Popov, Mihail Dutta, Akash Saillard, Emmanuelle Jannesari, Ali |
description | There is a large space of NUMA and hardware prefetcher configurations that
can significantly impact the performance of an application. Previous studies
have demonstrated how a model can automatically select configurations based on
the dynamic properties of the code to achieve speedups. This paper demonstrates
how the static Intermediate Representation (IR) of the code can guide
NUMA/prefetcher optimizations without the prohibitive cost of performance
profiling. We propose a method to create a comprehensive dataset that includes
a diverse set of intermediate representations along with optimum
configurations. We then apply a graph neural network model in order to validate
this dataset. We show that our static intermediate representation based model
achieves 80% of the performance gains provided by expensive dynamic performance
profiling based strategies. We further develop a hybrid model that uses both
static and dynamic information. Our hybrid model achieves the same gains as the
dynamic models but at a reduced cost by only profiling 30% of the programs. |
doi_str_mv | 10.48550/arxiv.2203.00611 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2203_00611</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2203_00611</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2203_006113</originalsourceid><addsrcrecordid>eNqFjrsOgkAQAK-xMOoHWLk_IB4ixtYYX4miMVqTjSxwEQ6yd_j6eoXYW00zmYwQfVc6k5nvyxHyU92d8Vh6jpRT122LZEfIWukEttoS5xQptAQnKpkMaYtWFdpAZWplzVimEFDFmH1hHwXfDMQFQ3DZzwF1BEemmOw1JTZwKK3K1btJdEUrxsxQ78eOGKyW58Vm2CyFJasc-RXWa2Gz5v03Prq5RYI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization</title><source>arXiv.org</source><creator>TehraniJamsaz, Ali ; Popov, Mihail ; Dutta, Akash ; Saillard, Emmanuelle ; Jannesari, Ali</creator><creatorcontrib>TehraniJamsaz, Ali ; Popov, Mihail ; Dutta, Akash ; Saillard, Emmanuelle ; Jannesari, Ali</creatorcontrib><description>There is a large space of NUMA and hardware prefetcher configurations that
can significantly impact the performance of an application. Previous studies
have demonstrated how a model can automatically select configurations based on
the dynamic properties of the code to achieve speedups. This paper demonstrates
how the static Intermediate Representation (IR) of the code can guide
NUMA/prefetcher optimizations without the prohibitive cost of performance
profiling. We propose a method to create a comprehensive dataset that includes
a diverse set of intermediate representations along with optimum
configurations. We then apply a graph neural network model in order to validate
this dataset. We show that our static intermediate representation based model
achieves 80% of the performance gains provided by expensive dynamic performance
profiling based strategies. We further develop a hybrid model that uses both
static and dynamic information. Our hybrid model achieves the same gains as the
dynamic models but at a reduced cost by only profiling 30% of the programs.</description><identifier>DOI: 10.48550/arxiv.2203.00611</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Distributed, Parallel, and Cluster Computing ; Computer Science - Learning</subject><creationdate>2022-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2203.00611$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2203.00611$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>TehraniJamsaz, Ali</creatorcontrib><creatorcontrib>Popov, Mihail</creatorcontrib><creatorcontrib>Dutta, Akash</creatorcontrib><creatorcontrib>Saillard, Emmanuelle</creatorcontrib><creatorcontrib>Jannesari, Ali</creatorcontrib><title>Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization</title><description>There is a large space of NUMA and hardware prefetcher configurations that
can significantly impact the performance of an application. Previous studies
have demonstrated how a model can automatically select configurations based on
the dynamic properties of the code to achieve speedups. This paper demonstrates
how the static Intermediate Representation (IR) of the code can guide
NUMA/prefetcher optimizations without the prohibitive cost of performance
profiling. We propose a method to create a comprehensive dataset that includes
a diverse set of intermediate representations along with optimum
configurations. We then apply a graph neural network model in order to validate
this dataset. We show that our static intermediate representation based model
achieves 80% of the performance gains provided by expensive dynamic performance
profiling based strategies. We further develop a hybrid model that uses both
static and dynamic information. Our hybrid model achieves the same gains as the
dynamic models but at a reduced cost by only profiling 30% of the programs.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjrsOgkAQAK-xMOoHWLk_IB4ixtYYX4miMVqTjSxwEQ6yd_j6eoXYW00zmYwQfVc6k5nvyxHyU92d8Vh6jpRT122LZEfIWukEttoS5xQptAQnKpkMaYtWFdpAZWplzVimEFDFmH1hHwXfDMQFQ3DZzwF1BEemmOw1JTZwKK3K1btJdEUrxsxQ78eOGKyW58Vm2CyFJasc-RXWa2Gz5v03Prq5RYI</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>TehraniJamsaz, Ali</creator><creator>Popov, Mihail</creator><creator>Dutta, Akash</creator><creator>Saillard, Emmanuelle</creator><creator>Jannesari, Ali</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220301</creationdate><title>Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization</title><author>TehraniJamsaz, Ali ; Popov, Mihail ; Dutta, Akash ; Saillard, Emmanuelle ; Jannesari, Ali</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2203_006113</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>TehraniJamsaz, Ali</creatorcontrib><creatorcontrib>Popov, Mihail</creatorcontrib><creatorcontrib>Dutta, Akash</creatorcontrib><creatorcontrib>Saillard, Emmanuelle</creatorcontrib><creatorcontrib>Jannesari, Ali</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>TehraniJamsaz, Ali</au><au>Popov, Mihail</au><au>Dutta, Akash</au><au>Saillard, Emmanuelle</au><au>Jannesari, Ali</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization</atitle><date>2022-03-01</date><risdate>2022</risdate><abstract>There is a large space of NUMA and hardware prefetcher configurations that
can significantly impact the performance of an application. Previous studies
have demonstrated how a model can automatically select configurations based on
the dynamic properties of the code to achieve speedups. This paper demonstrates
how the static Intermediate Representation (IR) of the code can guide
NUMA/prefetcher optimizations without the prohibitive cost of performance
profiling. We propose a method to create a comprehensive dataset that includes
a diverse set of intermediate representations along with optimum
configurations. We then apply a graph neural network model in order to validate
this dataset. We show that our static intermediate representation based model
achieves 80% of the performance gains provided by expensive dynamic performance
profiling based strategies. We further develop a hybrid model that uses both
static and dynamic information. Our hybrid model achieves the same gains as the
dynamic models but at a reduced cost by only profiling 30% of the programs.</abstract><doi>10.48550/arxiv.2203.00611</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2203.00611 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2203_00611 |
source | arXiv.org |
subjects | Computer Science - Artificial Intelligence Computer Science - Distributed, Parallel, and Cluster Computing Computer Science - Learning |
title | Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T18%3A32%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Learning%20Intermediate%20Representations%20using%20Graph%20Neural%20Networks%20for%20NUMA%20and%20Prefetchers%20Optimization&rft.au=TehraniJamsaz,%20Ali&rft.date=2022-03-01&rft_id=info:doi/10.48550/arxiv.2203.00611&rft_dat=%3Carxiv_GOX%3E2203_00611%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |