Cellular disco: resource management using virtual clusters on shared-memory multiprocessors

Despite the fact that large-scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all their features is still not available, mostly due to the complexity and cost of making the required changes to the operating system. A recently...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on computer systems 2000-08, Vol.18 (3), p.229-262
Hauptverfasser: Govil, Kinshuk, Teodosiu, Dan, Huang, Yongqiang, Rosenblum, Mendel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 262
container_issue 3
container_start_page 229
container_title ACM transactions on computer systems
container_volume 18
creator Govil, Kinshuk
Teodosiu, Dan
Huang, Yongqiang
Rosenblum, Mendel
description Despite the fact that large-scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all their features is still not available, mostly due to the complexity and cost of making the required changes to the operating system. A recently proposed approach, called Disco, substantially reduces this development cost by using a virtual machine monitor that laverages the existing operating system technology. In this paper we present a system called Cellular Disco that extends the Disco work to provide all the advantages of the hardware partitioning and scalable operating system approaches. We argue that Cellular Disco can achieve these benefits at only a small fraction of the development cost of modifying the operating system. Cellular Disco effectively turns a large-scale shared-memory multiprocessor into a virtual cluster that supports fault containment and heterogeneity, while avoiding operating system scalability bottlenecks. Yet at the same time, Cellular Disco preserves the benefits of a shared-memory multiprocessor by implementing dynamic, fine-grained resource sharing, and by allowing users to overcommit resources such as processors and memory. This hybrid approach requires a scalable resource manager that makes local decisions with limited information while still providing good global performance and fault containment. In this paper we describe our experience with a Cellular Disco prototype on a 32-processor SGI Origin 2000 system. We show that the execution time penalty for this approach is low, typically within 10% of the best available commercial operating system formost workloads, and that it can manage the CPU and memory resources of the machine significantly better than the hardware partitioning approach.
doi_str_mv 10.1145/354871.354873
format Article
fullrecord <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_228016243</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A69200126</galeid><sourcerecordid>A69200126</sourcerecordid><originalsourceid>FETCH-LOGICAL-a338t-ec2523db0f60391b7b5e0da2e79d60711ad404fddd1f770ff0b332097680a8583</originalsourceid><addsrcrecordid>eNp9kL1PwzAQxS0EEqHAxsCGmEk5f8XOWFV8SZVYYLac2K5SJXGxk4H_HpdUwFChG55099O9p4fQFYY5xozfU86kwPNvoUcow5yLXFBKj1EGgrKcgMCn6CzGDQCkPcnQ5dK27djqcGOaWPtzdOJ0G-3FXmfo_fHhbfmcr16fXpaLVa4plUNua8IJNRW4AmiJK1FxC0YTK0pTJBOsDQPmjDHYCQHOQZXcoBSFBC25pDN0O_3dBv8x2jiojR9DnywVIRJwQRj9hda6tarpnR-CrruUUy2KkgBgUiTo7gC0tr0NuvW9dU1a_8XzA3gaY7um_oevg48xWKe2oel0-FQY1K51NbU-yS7z9cSnoD_o_vYFxjB3Iw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>228016243</pqid></control><display><type>article</type><title>Cellular disco: resource management using virtual clusters on shared-memory multiprocessors</title><source>Access via ACM Digital Library</source><creator>Govil, Kinshuk ; Teodosiu, Dan ; Huang, Yongqiang ; Rosenblum, Mendel</creator><creatorcontrib>Govil, Kinshuk ; Teodosiu, Dan ; Huang, Yongqiang ; Rosenblum, Mendel</creatorcontrib><description>Despite the fact that large-scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all their features is still not available, mostly due to the complexity and cost of making the required changes to the operating system. A recently proposed approach, called Disco, substantially reduces this development cost by using a virtual machine monitor that laverages the existing operating system technology. In this paper we present a system called Cellular Disco that extends the Disco work to provide all the advantages of the hardware partitioning and scalable operating system approaches. We argue that Cellular Disco can achieve these benefits at only a small fraction of the development cost of modifying the operating system. Cellular Disco effectively turns a large-scale shared-memory multiprocessor into a virtual cluster that supports fault containment and heterogeneity, while avoiding operating system scalability bottlenecks. Yet at the same time, Cellular Disco preserves the benefits of a shared-memory multiprocessor by implementing dynamic, fine-grained resource sharing, and by allowing users to overcommit resources such as processors and memory. This hybrid approach requires a scalable resource manager that makes local decisions with limited information while still providing good global performance and fault containment. In this paper we describe our experience with a Cellular Disco prototype on a 32-processor SGI Origin 2000 system. We show that the execution time penalty for this approach is low, typically within 10% of the best available commercial operating system formost workloads, and that it can manage the CPU and memory resources of the machine significantly better than the hardware partitioning approach.</description><identifier>ISSN: 0734-2071</identifier><identifier>EISSN: 1557-7333</identifier><identifier>DOI: 10.1145/354871.354873</identifier><language>eng</language><publisher>New York, NY, USA: ACM</publisher><subject>Analysis ; Architectures ; Computer network equipment industry ; Computer software industry ; Computer systems organization ; Contextual software domains ; Cross-computing tools and techniques ; Design ; Extra-functional properties ; File systems management ; General and reference ; Information storage systems ; Information systems ; Memory management ; Multiprocessors ; Operating systems ; Process management ; Product information ; Reliability ; Software ; Software and its engineering ; Software organization and properties ; Software reliability ; Storage management ; Workloads</subject><ispartof>ACM transactions on computer systems, 2000-08, Vol.18 (3), p.229-262</ispartof><rights>ACM</rights><rights>COPYRIGHT 2000 Association for Computing Machinery, Inc.</rights><rights>Copyright Association for Computing Machinery Aug 2000</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a338t-ec2523db0f60391b7b5e0da2e79d60711ad404fddd1f770ff0b332097680a8583</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://dl.acm.org/doi/pdf/10.1145/354871.354873$$EPDF$$P50$$Gacm$$H</linktopdf><link.rule.ids>315,781,785,2283,27928,27929,40200,76232</link.rule.ids></links><search><creatorcontrib>Govil, Kinshuk</creatorcontrib><creatorcontrib>Teodosiu, Dan</creatorcontrib><creatorcontrib>Huang, Yongqiang</creatorcontrib><creatorcontrib>Rosenblum, Mendel</creatorcontrib><title>Cellular disco: resource management using virtual clusters on shared-memory multiprocessors</title><title>ACM transactions on computer systems</title><addtitle>ACM TOCS</addtitle><description>Despite the fact that large-scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all their features is still not available, mostly due to the complexity and cost of making the required changes to the operating system. A recently proposed approach, called Disco, substantially reduces this development cost by using a virtual machine monitor that laverages the existing operating system technology. In this paper we present a system called Cellular Disco that extends the Disco work to provide all the advantages of the hardware partitioning and scalable operating system approaches. We argue that Cellular Disco can achieve these benefits at only a small fraction of the development cost of modifying the operating system. Cellular Disco effectively turns a large-scale shared-memory multiprocessor into a virtual cluster that supports fault containment and heterogeneity, while avoiding operating system scalability bottlenecks. Yet at the same time, Cellular Disco preserves the benefits of a shared-memory multiprocessor by implementing dynamic, fine-grained resource sharing, and by allowing users to overcommit resources such as processors and memory. This hybrid approach requires a scalable resource manager that makes local decisions with limited information while still providing good global performance and fault containment. In this paper we describe our experience with a Cellular Disco prototype on a 32-processor SGI Origin 2000 system. We show that the execution time penalty for this approach is low, typically within 10% of the best available commercial operating system formost workloads, and that it can manage the CPU and memory resources of the machine significantly better than the hardware partitioning approach.</description><subject>Analysis</subject><subject>Architectures</subject><subject>Computer network equipment industry</subject><subject>Computer software industry</subject><subject>Computer systems organization</subject><subject>Contextual software domains</subject><subject>Cross-computing tools and techniques</subject><subject>Design</subject><subject>Extra-functional properties</subject><subject>File systems management</subject><subject>General and reference</subject><subject>Information storage systems</subject><subject>Information systems</subject><subject>Memory management</subject><subject>Multiprocessors</subject><subject>Operating systems</subject><subject>Process management</subject><subject>Product information</subject><subject>Reliability</subject><subject>Software</subject><subject>Software and its engineering</subject><subject>Software organization and properties</subject><subject>Software reliability</subject><subject>Storage management</subject><subject>Workloads</subject><issn>0734-2071</issn><issn>1557-7333</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2000</creationdate><recordtype>article</recordtype><recordid>eNp9kL1PwzAQxS0EEqHAxsCGmEk5f8XOWFV8SZVYYLac2K5SJXGxk4H_HpdUwFChG55099O9p4fQFYY5xozfU86kwPNvoUcow5yLXFBKj1EGgrKcgMCn6CzGDQCkPcnQ5dK27djqcGOaWPtzdOJ0G-3FXmfo_fHhbfmcr16fXpaLVa4plUNua8IJNRW4AmiJK1FxC0YTK0pTJBOsDQPmjDHYCQHOQZXcoBSFBC25pDN0O_3dBv8x2jiojR9DnywVIRJwQRj9hda6tarpnR-CrruUUy2KkgBgUiTo7gC0tr0NuvW9dU1a_8XzA3gaY7um_oevg48xWKe2oel0-FQY1K51NbU-yS7z9cSnoD_o_vYFxjB3Iw</recordid><startdate>20000801</startdate><enddate>20000801</enddate><creator>Govil, Kinshuk</creator><creator>Teodosiu, Dan</creator><creator>Huang, Yongqiang</creator><creator>Rosenblum, Mendel</creator><general>ACM</general><general>Association for Computing Machinery, Inc</general><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20000801</creationdate><title>Cellular disco</title><author>Govil, Kinshuk ; Teodosiu, Dan ; Huang, Yongqiang ; Rosenblum, Mendel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a338t-ec2523db0f60391b7b5e0da2e79d60711ad404fddd1f770ff0b332097680a8583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Analysis</topic><topic>Architectures</topic><topic>Computer network equipment industry</topic><topic>Computer software industry</topic><topic>Computer systems organization</topic><topic>Contextual software domains</topic><topic>Cross-computing tools and techniques</topic><topic>Design</topic><topic>Extra-functional properties</topic><topic>File systems management</topic><topic>General and reference</topic><topic>Information storage systems</topic><topic>Information systems</topic><topic>Memory management</topic><topic>Multiprocessors</topic><topic>Operating systems</topic><topic>Process management</topic><topic>Product information</topic><topic>Reliability</topic><topic>Software</topic><topic>Software and its engineering</topic><topic>Software organization and properties</topic><topic>Software reliability</topic><topic>Storage management</topic><topic>Workloads</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Govil, Kinshuk</creatorcontrib><creatorcontrib>Teodosiu, Dan</creatorcontrib><creatorcontrib>Huang, Yongqiang</creatorcontrib><creatorcontrib>Rosenblum, Mendel</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on computer systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Govil, Kinshuk</au><au>Teodosiu, Dan</au><au>Huang, Yongqiang</au><au>Rosenblum, Mendel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cellular disco: resource management using virtual clusters on shared-memory multiprocessors</atitle><jtitle>ACM transactions on computer systems</jtitle><stitle>ACM TOCS</stitle><date>2000-08-01</date><risdate>2000</risdate><volume>18</volume><issue>3</issue><spage>229</spage><epage>262</epage><pages>229-262</pages><issn>0734-2071</issn><eissn>1557-7333</eissn><abstract>Despite the fact that large-scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all their features is still not available, mostly due to the complexity and cost of making the required changes to the operating system. A recently proposed approach, called Disco, substantially reduces this development cost by using a virtual machine monitor that laverages the existing operating system technology. In this paper we present a system called Cellular Disco that extends the Disco work to provide all the advantages of the hardware partitioning and scalable operating system approaches. We argue that Cellular Disco can achieve these benefits at only a small fraction of the development cost of modifying the operating system. Cellular Disco effectively turns a large-scale shared-memory multiprocessor into a virtual cluster that supports fault containment and heterogeneity, while avoiding operating system scalability bottlenecks. Yet at the same time, Cellular Disco preserves the benefits of a shared-memory multiprocessor by implementing dynamic, fine-grained resource sharing, and by allowing users to overcommit resources such as processors and memory. This hybrid approach requires a scalable resource manager that makes local decisions with limited information while still providing good global performance and fault containment. In this paper we describe our experience with a Cellular Disco prototype on a 32-processor SGI Origin 2000 system. We show that the execution time penalty for this approach is low, typically within 10% of the best available commercial operating system formost workloads, and that it can manage the CPU and memory resources of the machine significantly better than the hardware partitioning approach.</abstract><cop>New York, NY, USA</cop><pub>ACM</pub><doi>10.1145/354871.354873</doi><tpages>34</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0734-2071
ispartof ACM transactions on computer systems, 2000-08, Vol.18 (3), p.229-262
issn 0734-2071
1557-7333
language eng
recordid cdi_proquest_journals_228016243
source Access via ACM Digital Library
subjects Analysis
Architectures
Computer network equipment industry
Computer software industry
Computer systems organization
Contextual software domains
Cross-computing tools and techniques
Design
Extra-functional properties
File systems management
General and reference
Information storage systems
Information systems
Memory management
Multiprocessors
Operating systems
Process management
Product information
Reliability
Software
Software and its engineering
Software organization and properties
Software reliability
Storage management
Workloads
title Cellular disco: resource management using virtual clusters on shared-memory multiprocessors
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T11%3A19%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cellular%20disco:%20resource%20management%20using%20virtual%20clusters%20on%20shared-memory%20multiprocessors&rft.jtitle=ACM%20transactions%20on%20computer%20systems&rft.au=Govil,%20Kinshuk&rft.date=2000-08-01&rft.volume=18&rft.issue=3&rft.spage=229&rft.epage=262&rft.pages=229-262&rft.issn=0734-2071&rft.eissn=1557-7333&rft_id=info:doi/10.1145/354871.354873&rft_dat=%3Cgale_proqu%3EA69200126%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=228016243&rft_id=info:pmid/&rft_galeid=A69200126&rfr_iscdi=true