Ballooning Graphics Memory Space in Full GPU Virtualization Environments

Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions hav...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Scientific programming 2019-01, Vol.2019 (2019), p.1-11
Hauptverfasser:	Park, Younghun, Park, Sungyong, Gu, Minwoo
Format:	Artikel
Sprache:	eng
Schlagworte:	Application programming interface Benchmarks Elasticity Graphics boards Graphics processing units International conferences Linux Operating systems Performance degradation Performance enhancement Product development Servers Software Virtual environments
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	11
container_issue	2019
container_start_page	1
container_title	Scientific programming
container_volume	2019
creator	Park, Younghun Park, Sungyong Gu, Minwoo
description	Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.
doi_str_mv	10.1155/2019/5240956
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2220168946</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2220168946</sourcerecordid><originalsourceid>FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</originalsourceid><addsrcrecordid>eNqF0M9LwzAUwPEgCs7pzbMEPGpdkiZtctSxH8JEQSfeymuXuowuqUmrzL_ejgoePb13-PAefBE6p-SGUiFGjFA1EowTJZIDNKAyFZGi6u2w24mQkWKcH6OTEDaEUEkJGaD5HVSVc9bYdzzzUK9NEfCD3jq_w881FBobi6dtVeHZ0xK_Gt-0UJlvaIyzeGI_jXd2q20TTtFRCVXQZ79ziJbTyct4Hi0eZ_fj20VUxDRtolQpCSwXcSmVFqrMdUo5z3muWV4wAEI0YXSV5HECMUmlBhCaxUwKkYhCi3iILvu7tXcfrQ5NtnGtt93LjLEuQCIVTzp13avCuxC8LrPamy34XUZJtm-V7Vtlv606ftXztbEr-DL_6Yte687oEv40oyTlNP4B6z5x7w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2220168946</pqid></control><display><type>article</type><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><source>EZB-FREE-00999 freely available EZB journals</source><source>Wiley Online Library (Open Access Collection)</source><source>Alma/SFX Local Collection</source><creator>Park, Younghun ; Park, Sungyong ; Gu, Minwoo</creator><contributor>Abdelrahman, Tarek ; Tarek Abdelrahman</contributor><creatorcontrib>Park, Younghun ; Park, Sungyong ; Gu, Minwoo ; Abdelrahman, Tarek ; Tarek Abdelrahman</creatorcontrib><description>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</description><identifier>ISSN: 1058-9244</identifier><identifier>EISSN: 1875-919X</identifier><identifier>DOI: 10.1155/2019/5240956</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><subject>Application programming interface ; Benchmarks ; Elasticity ; Graphics boards ; Graphics processing units ; International conferences ; Linux ; Operating systems ; Performance degradation ; Performance enhancement ; Product development ; Servers ; Software ; Virtual environments</subject><ispartof>Scientific programming, 2019-01, Vol.2019 (2019), p.1-11</ispartof><rights>Copyright © 2019 Younghun Park et al.</rights><rights>Copyright © 2019 Younghun Park et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</cites><orcidid>0000-0002-0309-1820</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><contributor>Abdelrahman, Tarek</contributor><contributor>Tarek Abdelrahman</contributor><creatorcontrib>Park, Younghun</creatorcontrib><creatorcontrib>Park, Sungyong</creatorcontrib><creatorcontrib>Gu, Minwoo</creatorcontrib><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><title>Scientific programming</title><description>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</description><subject>Application programming interface</subject><subject>Benchmarks</subject><subject>Elasticity</subject><subject>Graphics boards</subject><subject>Graphics processing units</subject><subject>International conferences</subject><subject>Linux</subject><subject>Operating systems</subject><subject>Performance degradation</subject><subject>Performance enhancement</subject><subject>Product development</subject><subject>Servers</subject><subject>Software</subject><subject>Virtual environments</subject><issn>1058-9244</issn><issn>1875-919X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><recordid>eNqF0M9LwzAUwPEgCs7pzbMEPGpdkiZtctSxH8JEQSfeymuXuowuqUmrzL_ejgoePb13-PAefBE6p-SGUiFGjFA1EowTJZIDNKAyFZGi6u2w24mQkWKcH6OTEDaEUEkJGaD5HVSVc9bYdzzzUK9NEfCD3jq_w881FBobi6dtVeHZ0xK_Gt-0UJlvaIyzeGI_jXd2q20TTtFRCVXQZ79ziJbTyct4Hi0eZ_fj20VUxDRtolQpCSwXcSmVFqrMdUo5z3muWV4wAEI0YXSV5HECMUmlBhCaxUwKkYhCi3iILvu7tXcfrQ5NtnGtt93LjLEuQCIVTzp13avCuxC8LrPamy34XUZJtm-V7Vtlv606ftXztbEr-DL_6Yte687oEv40oyTlNP4B6z5x7w</recordid><startdate>20190101</startdate><enddate>20190101</enddate><creator>Park, Younghun</creator><creator>Park, Sungyong</creator><creator>Gu, Minwoo</creator><general>Hindawi Publishing Corporation</general><general>Hindawi</general><general>Hindawi Limited</general><scope>ADJCN</scope><scope>AHFXO</scope><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-0309-1820</orcidid></search><sort><creationdate>20190101</creationdate><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><author>Park, Younghun ; Park, Sungyong ; Gu, Minwoo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Application programming interface</topic><topic>Benchmarks</topic><topic>Elasticity</topic><topic>Graphics boards</topic><topic>Graphics processing units</topic><topic>International conferences</topic><topic>Linux</topic><topic>Operating systems</topic><topic>Performance degradation</topic><topic>Performance enhancement</topic><topic>Product development</topic><topic>Servers</topic><topic>Software</topic><topic>Virtual environments</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Park, Younghun</creatorcontrib><creatorcontrib>Park, Sungyong</creatorcontrib><creatorcontrib>Gu, Minwoo</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Scientific programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Park, Younghun</au><au>Park, Sungyong</au><au>Gu, Minwoo</au><au>Abdelrahman, Tarek</au><au>Tarek Abdelrahman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</atitle><jtitle>Scientific programming</jtitle><date>2019-01-01</date><risdate>2019</risdate><volume>2019</volume><issue>2019</issue><spage>1</spage><epage>11</epage><pages>1-11</pages><issn>1058-9244</issn><eissn>1875-919X</eissn><abstract>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><doi>10.1155/2019/5240956</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-0309-1820</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1058-9244
ispartof	Scientific programming, 2019-01, Vol.2019 (2019), p.1-11
issn	1058-9244 1875-919X
language	eng
recordid	cdi_proquest_journals_2220168946
source	EZB-FREE-00999 freely available EZB journals; Wiley Online Library (Open Access Collection); Alma/SFX Local Collection
subjects	Application programming interface Benchmarks Elasticity Graphics boards Graphics processing units International conferences Linux Operating systems Performance degradation Performance enhancement Product development Servers Software Virtual environments
title	Ballooning Graphics Memory Space in Full GPU Virtualization Environments
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T12%3A19%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Ballooning%20Graphics%20Memory%20Space%20in%20Full%20GPU%20Virtualization%20Environments&rft.jtitle=Scientific%20programming&rft.au=Park,%20Younghun&rft.date=2019-01-01&rft.volume=2019&rft.issue=2019&rft.spage=1&rft.epage=11&rft.pages=1-11&rft.issn=1058-9244&rft.eissn=1875-919X&rft_id=info:doi/10.1155/2019/5240956&rft_dat=%3Cproquest_cross%3E2220168946%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2220168946&rft_id=info:pmid/&rfr_iscdi=true