Ballooning Graphics Memory Space in Full GPU Virtualization Environments

Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions hav...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific programming 2019-01, Vol.2019 (2019), p.1-11
Hauptverfasser: Park, Younghun, Park, Sungyong, Gu, Minwoo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 11
container_issue 2019
container_start_page 1
container_title Scientific programming
container_volume 2019
creator Park, Younghun
Park, Sungyong
Gu, Minwoo
description Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.
doi_str_mv 10.1155/2019/5240956
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2220168946</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2220168946</sourcerecordid><originalsourceid>FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</originalsourceid><addsrcrecordid>eNqF0M9LwzAUwPEgCs7pzbMEPGpdkiZtctSxH8JEQSfeymuXuowuqUmrzL_ejgoePb13-PAefBE6p-SGUiFGjFA1EowTJZIDNKAyFZGi6u2w24mQkWKcH6OTEDaEUEkJGaD5HVSVc9bYdzzzUK9NEfCD3jq_w881FBobi6dtVeHZ0xK_Gt-0UJlvaIyzeGI_jXd2q20TTtFRCVXQZ79ziJbTyct4Hi0eZ_fj20VUxDRtolQpCSwXcSmVFqrMdUo5z3muWV4wAEI0YXSV5HECMUmlBhCaxUwKkYhCi3iILvu7tXcfrQ5NtnGtt93LjLEuQCIVTzp13avCuxC8LrPamy34XUZJtm-V7Vtlv606ftXztbEr-DL_6Yte687oEv40oyTlNP4B6z5x7w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2220168946</pqid></control><display><type>article</type><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><source>EZB-FREE-00999 freely available EZB journals</source><source>Wiley Online Library (Open Access Collection)</source><source>Alma/SFX Local Collection</source><creator>Park, Younghun ; Park, Sungyong ; Gu, Minwoo</creator><contributor>Abdelrahman, Tarek ; Tarek Abdelrahman</contributor><creatorcontrib>Park, Younghun ; Park, Sungyong ; Gu, Minwoo ; Abdelrahman, Tarek ; Tarek Abdelrahman</creatorcontrib><description>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</description><identifier>ISSN: 1058-9244</identifier><identifier>EISSN: 1875-919X</identifier><identifier>DOI: 10.1155/2019/5240956</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><subject>Application programming interface ; Benchmarks ; Elasticity ; Graphics boards ; Graphics processing units ; International conferences ; Linux ; Operating systems ; Performance degradation ; Performance enhancement ; Product development ; Servers ; Software ; Virtual environments</subject><ispartof>Scientific programming, 2019-01, Vol.2019 (2019), p.1-11</ispartof><rights>Copyright © 2019 Younghun Park et al.</rights><rights>Copyright © 2019 Younghun Park et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</cites><orcidid>0000-0002-0309-1820</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><contributor>Abdelrahman, Tarek</contributor><contributor>Tarek Abdelrahman</contributor><creatorcontrib>Park, Younghun</creatorcontrib><creatorcontrib>Park, Sungyong</creatorcontrib><creatorcontrib>Gu, Minwoo</creatorcontrib><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><title>Scientific programming</title><description>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</description><subject>Application programming interface</subject><subject>Benchmarks</subject><subject>Elasticity</subject><subject>Graphics boards</subject><subject>Graphics processing units</subject><subject>International conferences</subject><subject>Linux</subject><subject>Operating systems</subject><subject>Performance degradation</subject><subject>Performance enhancement</subject><subject>Product development</subject><subject>Servers</subject><subject>Software</subject><subject>Virtual environments</subject><issn>1058-9244</issn><issn>1875-919X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><recordid>eNqF0M9LwzAUwPEgCs7pzbMEPGpdkiZtctSxH8JEQSfeymuXuowuqUmrzL_ejgoePb13-PAefBE6p-SGUiFGjFA1EowTJZIDNKAyFZGi6u2w24mQkWKcH6OTEDaEUEkJGaD5HVSVc9bYdzzzUK9NEfCD3jq_w881FBobi6dtVeHZ0xK_Gt-0UJlvaIyzeGI_jXd2q20TTtFRCVXQZ79ziJbTyct4Hi0eZ_fj20VUxDRtolQpCSwXcSmVFqrMdUo5z3muWV4wAEI0YXSV5HECMUmlBhCaxUwKkYhCi3iILvu7tXcfrQ5NtnGtt93LjLEuQCIVTzp13avCuxC8LrPamy34XUZJtm-V7Vtlv606ftXztbEr-DL_6Yte687oEv40oyTlNP4B6z5x7w</recordid><startdate>20190101</startdate><enddate>20190101</enddate><creator>Park, Younghun</creator><creator>Park, Sungyong</creator><creator>Gu, Minwoo</creator><general>Hindawi Publishing Corporation</general><general>Hindawi</general><general>Hindawi Limited</general><scope>ADJCN</scope><scope>AHFXO</scope><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-0309-1820</orcidid></search><sort><creationdate>20190101</creationdate><title>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</title><author>Park, Younghun ; Park, Sungyong ; Gu, Minwoo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c317t-7998a2b53f89e59fbe7144b4be2bc2aa00e021d6b36a3078eaa5e23285565ce53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Application programming interface</topic><topic>Benchmarks</topic><topic>Elasticity</topic><topic>Graphics boards</topic><topic>Graphics processing units</topic><topic>International conferences</topic><topic>Linux</topic><topic>Operating systems</topic><topic>Performance degradation</topic><topic>Performance enhancement</topic><topic>Product development</topic><topic>Servers</topic><topic>Software</topic><topic>Virtual environments</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Park, Younghun</creatorcontrib><creatorcontrib>Park, Sungyong</creatorcontrib><creatorcontrib>Gu, Minwoo</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Scientific programming</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Park, Younghun</au><au>Park, Sungyong</au><au>Gu, Minwoo</au><au>Abdelrahman, Tarek</au><au>Tarek Abdelrahman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Ballooning Graphics Memory Space in Full GPU Virtualization Environments</atitle><jtitle>Scientific programming</jtitle><date>2019-01-01</date><risdate>2019</risdate><volume>2019</volume><issue>2019</issue><spage>1</spage><epage>11</epage><pages>1-11</pages><issn>1058-9244</issn><eissn>1875-919X</eissn><abstract>Advances in virtualization technology have enabled multiple virtual machines (VMs) to share resources in a physical machine (PM). With the widespread use of graphics-intensive applications, such as two-dimensional (2D) or 3D rendering, many graphics processing unit (GPU) virtualization solutions have been proposed to provide high-performance GPU services in a virtualized environment. Although elasticity is one of the major benefits in this environment, the allocation of GPU memory is still static in the sense that after the GPU memory is allocated to a VM, it is not possible to change the memory size at runtime. This causes underutilization of GPU memory or performance degradation of a GPU application due to the lack of GPU memory when an application requires a large amount of GPU memory. In this paper, we propose a GPU memory ballooning solution called gBalloon that dynamically adjusts the GPU memory size at runtime according to the GPU memory requirement of each VM and the GPU memory sharing overhead. The gBalloon extends the GPU memory size of a VM by detecting performance degradation due to the lack of GPU memory. The gBalloon also reduces the GPU memory size when the overcommitted or underutilized GPU memory of a VM creates additional overhead for the GPU context switch or the CPU load due to GPU memory sharing among the VMs. We implemented the gBalloon by modifying the gVirt, a full GPU virtualization solution for Intel’s integrated GPUs. Benchmarking results show that the gBalloon dynamically adjusts the GPU memory size at runtime, which improves the performance by up to 8% against the gVirt with 384 MB of high global graphics memory and 32% against the gVirt with 1024 MB of high global graphics memory.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><doi>10.1155/2019/5240956</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-0309-1820</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1058-9244
ispartof Scientific programming, 2019-01, Vol.2019 (2019), p.1-11
issn 1058-9244
1875-919X
language eng
recordid cdi_proquest_journals_2220168946
source EZB-FREE-00999 freely available EZB journals; Wiley Online Library (Open Access Collection); Alma/SFX Local Collection
subjects Application programming interface
Benchmarks
Elasticity
Graphics boards
Graphics processing units
International conferences
Linux
Operating systems
Performance degradation
Performance enhancement
Product development
Servers
Software
Virtual environments
title Ballooning Graphics Memory Space in Full GPU Virtualization Environments
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T12%3A19%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Ballooning%20Graphics%20Memory%20Space%20in%20Full%20GPU%20Virtualization%20Environments&rft.jtitle=Scientific%20programming&rft.au=Park,%20Younghun&rft.date=2019-01-01&rft.volume=2019&rft.issue=2019&rft.spage=1&rft.epage=11&rft.pages=1-11&rft.issn=1058-9244&rft.eissn=1875-919X&rft_id=info:doi/10.1155/2019/5240956&rft_dat=%3Cproquest_cross%3E2220168946%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2220168946&rft_id=info:pmid/&rfr_iscdi=true