Scout: An Experienced Guide to Find the Best Cloud Configuration

Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hsu, Chin-Jung, Nair, Vivek, Menzies, Tim, Freeh, Vincent W
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Hsu, Chin-Jung
Nair, Vivek
Menzies, Tim
Freeh, Vincent W
description Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to any workloads, it is fragile because performance and workload are hard to model (to predict). In this paper, we propose a novel method, SCOUT. The central insight of SCOUT is that using prior measurements, even those for different workloads, improves search performance and reduces search cost. At its core, SCOUT extracts search hints (inference of resource requirements) from low-level performance metrics. Such hints enable SCOUT to navigate through the search space more efficiently---only spotlight region will be searched. We evaluate SCOUT with 107 workloads on Apache Hadoop and Spark. The experimental results demonstrate that our approach finds better cloud configurations with a lower search cost than state of the art methods. Based on this work, we conclude that (i) low-level performance information is necessary for finding the right cloud configuration in an effective, efficient and reliable way, and (ii) a search method can be guided by historical data, thereby reducing cost and improving performance.
doi_str_mv 10.48550/arxiv.1803.01296
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1803_01296</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1803_01296</sourcerecordid><originalsourceid>FETCH-LOGICAL-a676-3611ad147632d8b74f9aa000a5e9e7935c99c205563a85061e271da3a1f228c3</originalsourceid><addsrcrecordid>eNotz7tOwzAUgGEvHVDLAzDhF0jwJb4xUaJekCoxlD062CdgqdiV66Dy9ojS6d9-6SPkjrO2s0qxByjn-N1yy2TLuHD6hjztfZ7qI10mujofsURMHgPdTDEgrZmuYwq0fiJ9xlOl_SFPgfY5jfFjKlBjTgsyG-Fwwttr52S_Xr3122b3unnpl7sGtNGN1JxD4J3RUgT7brrRATDGQKFD46TyznnBlNISrGKaozA8gAQ-CmG9nJP7_-tFMBxL_ILyM_xJhotE_gLTlEGH</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Scout: An Experienced Guide to Find the Best Cloud Configuration</title><source>arXiv.org</source><creator>Hsu, Chin-Jung ; Nair, Vivek ; Menzies, Tim ; Freeh, Vincent W</creator><creatorcontrib>Hsu, Chin-Jung ; Nair, Vivek ; Menzies, Tim ; Freeh, Vincent W</creatorcontrib><description>Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to any workloads, it is fragile because performance and workload are hard to model (to predict). In this paper, we propose a novel method, SCOUT. The central insight of SCOUT is that using prior measurements, even those for different workloads, improves search performance and reduces search cost. At its core, SCOUT extracts search hints (inference of resource requirements) from low-level performance metrics. Such hints enable SCOUT to navigate through the search space more efficiently---only spotlight region will be searched. We evaluate SCOUT with 107 workloads on Apache Hadoop and Spark. The experimental results demonstrate that our approach finds better cloud configurations with a lower search cost than state of the art methods. Based on this work, we conclude that (i) low-level performance information is necessary for finding the right cloud configuration in an effective, efficient and reliable way, and (ii) a search method can be guided by historical data, thereby reducing cost and improving performance.</description><identifier>DOI: 10.48550/arxiv.1803.01296</identifier><language>eng</language><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><creationdate>2018-03</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,782,887</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1803.01296$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1803.01296$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Hsu, Chin-Jung</creatorcontrib><creatorcontrib>Nair, Vivek</creatorcontrib><creatorcontrib>Menzies, Tim</creatorcontrib><creatorcontrib>Freeh, Vincent W</creatorcontrib><title>Scout: An Experienced Guide to Find the Best Cloud Configuration</title><description>Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to any workloads, it is fragile because performance and workload are hard to model (to predict). In this paper, we propose a novel method, SCOUT. The central insight of SCOUT is that using prior measurements, even those for different workloads, improves search performance and reduces search cost. At its core, SCOUT extracts search hints (inference of resource requirements) from low-level performance metrics. Such hints enable SCOUT to navigate through the search space more efficiently---only spotlight region will be searched. We evaluate SCOUT with 107 workloads on Apache Hadoop and Spark. The experimental results demonstrate that our approach finds better cloud configurations with a lower search cost than state of the art methods. Based on this work, we conclude that (i) low-level performance information is necessary for finding the right cloud configuration in an effective, efficient and reliable way, and (ii) a search method can be guided by historical data, thereby reducing cost and improving performance.</description><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz7tOwzAUgGEvHVDLAzDhF0jwJb4xUaJekCoxlD062CdgqdiV66Dy9ojS6d9-6SPkjrO2s0qxByjn-N1yy2TLuHD6hjztfZ7qI10mujofsURMHgPdTDEgrZmuYwq0fiJ9xlOl_SFPgfY5jfFjKlBjTgsyG-Fwwttr52S_Xr3122b3unnpl7sGtNGN1JxD4J3RUgT7brrRATDGQKFD46TyznnBlNISrGKaozA8gAQ-CmG9nJP7_-tFMBxL_ILyM_xJhotE_gLTlEGH</recordid><startdate>20180303</startdate><enddate>20180303</enddate><creator>Hsu, Chin-Jung</creator><creator>Nair, Vivek</creator><creator>Menzies, Tim</creator><creator>Freeh, Vincent W</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180303</creationdate><title>Scout: An Experienced Guide to Find the Best Cloud Configuration</title><author>Hsu, Chin-Jung ; Nair, Vivek ; Menzies, Tim ; Freeh, Vincent W</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a676-3611ad147632d8b74f9aa000a5e9e7935c99c205563a85061e271da3a1f228c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><toplevel>online_resources</toplevel><creatorcontrib>Hsu, Chin-Jung</creatorcontrib><creatorcontrib>Nair, Vivek</creatorcontrib><creatorcontrib>Menzies, Tim</creatorcontrib><creatorcontrib>Freeh, Vincent W</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hsu, Chin-Jung</au><au>Nair, Vivek</au><au>Menzies, Tim</au><au>Freeh, Vincent W</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Scout: An Experienced Guide to Find the Best Cloud Configuration</atitle><date>2018-03-03</date><risdate>2018</risdate><abstract>Finding the right cloud configuration for workloads is an essential step to ensure good performance and contain running costs. A poor choice of cloud configuration decreases application performance and increases running cost significantly. While Bayesian Optimization is effective and applicable to any workloads, it is fragile because performance and workload are hard to model (to predict). In this paper, we propose a novel method, SCOUT. The central insight of SCOUT is that using prior measurements, even those for different workloads, improves search performance and reduces search cost. At its core, SCOUT extracts search hints (inference of resource requirements) from low-level performance metrics. Such hints enable SCOUT to navigate through the search space more efficiently---only spotlight region will be searched. We evaluate SCOUT with 107 workloads on Apache Hadoop and Spark. The experimental results demonstrate that our approach finds better cloud configurations with a lower search cost than state of the art methods. Based on this work, we conclude that (i) low-level performance information is necessary for finding the right cloud configuration in an effective, efficient and reliable way, and (ii) a search method can be guided by historical data, thereby reducing cost and improving performance.</abstract><doi>10.48550/arxiv.1803.01296</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1803.01296
ispartof
issn
language eng
recordid cdi_arxiv_primary_1803_01296
source arXiv.org
subjects Computer Science - Distributed, Parallel, and Cluster Computing
title Scout: An Experienced Guide to Find the Best Cloud Configuration
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T20%3A37%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Scout:%20An%20Experienced%20Guide%20to%20Find%20the%20Best%20Cloud%20Configuration&rft.au=Hsu,%20Chin-Jung&rft.date=2018-03-03&rft_id=info:doi/10.48550/arxiv.1803.01296&rft_dat=%3Carxiv_GOX%3E1803_01296%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true