Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization

Reserved instances offered by cloud providers make it possible to reserve resources and computing capacity for a specific period of time. One should pay for all the hours of that time interval; in exchange, the hourly rate is significantly lower than on-demand instances. Reserved Instances can signi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on cloud computing 2021-10, Vol.9 (4), p.1563-1574
Hauptverfasser: Nabavinejad, Seyed Morteza, Goudarzi, Maziar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1574
container_issue 4
container_start_page 1563
container_title IEEE transactions on cloud computing
container_volume 9
creator Nabavinejad, Seyed Morteza
Goudarzi, Maziar
description Reserved instances offered by cloud providers make it possible to reserve resources and computing capacity for a specific period of time. One should pay for all the hours of that time interval; in exchange, the hourly rate is significantly lower than on-demand instances. Reserved Instances can significantly reduce the monetary cost of resources needed to process big data applications in cloud. However, purchases of these instances are non-refundable, and hence, one should be able to estimate the required resources prior to purchase to avoid over-payment. It becomes important especially when the results obtained by big data job has monetary value, such as business intelligence applications. But, estimating the resource demand of big data processing jobs is hard because of numerous factors that affect them such as data locality, data skew, stragglers, internal settings of big data processing framework, interference among instances, instances availability, etc. To maximize the profit of processing such big data jobs in cloud considering fluctuating nature of their resource demand, as well as reserved instances limitations, we propose Reserved Instances Stochastic Allocation (RISA) approach. Using historical traces of resource demand of big data jobs submitted by user, RISA leverages stochastic optimization to determine the amount of resources needed to be reserved for that user to maximize the profit. Our evaluation using real-world traces shows that RISA can increase the net profit by up to 10x, compared to previous approaches. RISA can also find solutions as close as 2 percent to the best possible solution.
doi_str_mv 10.1109/TCC.2019.2926254
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_8752400</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8752400</ieee_id><sourcerecordid>2605708652</sourcerecordid><originalsourceid>FETCH-LOGICAL-c244t-54c897f1e7518f5ecbed71a217b4a5d606fa7660a1581e3f8ca13ced1073fa663</originalsourceid><addsrcrecordid>eNpNkE1LAzEQhoMoWGrvgpeA562Z7OZjj7pavyoVbM8hzSY1pd3UTQrqr3dLS3Eu7xyedwYehC6BDAFIeTOtqiElUA5pSTllxQnqUeAyE8Dh9N9-jgYxLkk3kkEJZQ-9vrfB-YTf9Ldf-1-dfGhwcPjOL_C9Thq_hHnEvsHVKmxrPIu-WeCPFMynjskbPNmkY-8CnTm9inZwyD6ajR6m1VM2njw-V7fjzNCiSBkrjCyFAysYSMesmdtagKYg5oVmNSfcacE50cAk2NxJoyE3tgYicqc5z_voen9304avrY1JLcO2bbqXinLCBJGc0Y4ie8q0IcbWOrVp_Vq3PwqI2llTnTW1s6YO1rrK1b7irbVHXApGC0LyP0K_ZzY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2605708652</pqid></control><display><type>article</type><title>Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization</title><source>IEEE Electronic Library (IEL)</source><creator>Nabavinejad, Seyed Morteza ; Goudarzi, Maziar</creator><creatorcontrib>Nabavinejad, Seyed Morteza ; Goudarzi, Maziar</creatorcontrib><description>Reserved instances offered by cloud providers make it possible to reserve resources and computing capacity for a specific period of time. One should pay for all the hours of that time interval; in exchange, the hourly rate is significantly lower than on-demand instances. Reserved Instances can significantly reduce the monetary cost of resources needed to process big data applications in cloud. However, purchases of these instances are non-refundable, and hence, one should be able to estimate the required resources prior to purchase to avoid over-payment. It becomes important especially when the results obtained by big data job has monetary value, such as business intelligence applications. But, estimating the resource demand of big data processing jobs is hard because of numerous factors that affect them such as data locality, data skew, stragglers, internal settings of big data processing framework, interference among instances, instances availability, etc. To maximize the profit of processing such big data jobs in cloud considering fluctuating nature of their resource demand, as well as reserved instances limitations, we propose Reserved Instances Stochastic Allocation (RISA) approach. Using historical traces of resource demand of big data jobs submitted by user, RISA leverages stochastic optimization to determine the amount of resources needed to be reserved for that user to maximize the profit. Our evaluation using real-world traces shows that RISA can increase the net profit by up to 10x, compared to previous approaches. RISA can also find solutions as close as 2 percent to the best possible solution.</description><identifier>ISSN: 2168-7161</identifier><identifier>EISSN: 2168-7161</identifier><identifier>EISSN: 2372-0018</identifier><identifier>DOI: 10.1109/TCC.2019.2926254</identifier><identifier>CODEN: ITCCF6</identifier><language>eng</language><publisher>Piscataway: IEEE Computer Society</publisher><subject>Big Data ; big data processing ; Cloud computing ; Contracts ; Data processing ; Information management ; Intelligence (information) ; Optimization ; Probability distribution ; profit maximization ; reserved instances ; Resource management ; stochastic optimization</subject><ispartof>IEEE transactions on cloud computing, 2021-10, Vol.9 (4), p.1563-1574</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c244t-54c897f1e7518f5ecbed71a217b4a5d606fa7660a1581e3f8ca13ced1073fa663</cites><orcidid>0000-0002-5123-6318 ; 0000-0002-1272-4589</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8752400$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8752400$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Nabavinejad, Seyed Morteza</creatorcontrib><creatorcontrib>Goudarzi, Maziar</creatorcontrib><title>Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization</title><title>IEEE transactions on cloud computing</title><addtitle>TCC</addtitle><description>Reserved instances offered by cloud providers make it possible to reserve resources and computing capacity for a specific period of time. One should pay for all the hours of that time interval; in exchange, the hourly rate is significantly lower than on-demand instances. Reserved Instances can significantly reduce the monetary cost of resources needed to process big data applications in cloud. However, purchases of these instances are non-refundable, and hence, one should be able to estimate the required resources prior to purchase to avoid over-payment. It becomes important especially when the results obtained by big data job has monetary value, such as business intelligence applications. But, estimating the resource demand of big data processing jobs is hard because of numerous factors that affect them such as data locality, data skew, stragglers, internal settings of big data processing framework, interference among instances, instances availability, etc. To maximize the profit of processing such big data jobs in cloud considering fluctuating nature of their resource demand, as well as reserved instances limitations, we propose Reserved Instances Stochastic Allocation (RISA) approach. Using historical traces of resource demand of big data jobs submitted by user, RISA leverages stochastic optimization to determine the amount of resources needed to be reserved for that user to maximize the profit. Our evaluation using real-world traces shows that RISA can increase the net profit by up to 10x, compared to previous approaches. RISA can also find solutions as close as 2 percent to the best possible solution.</description><subject>Big Data</subject><subject>big data processing</subject><subject>Cloud computing</subject><subject>Contracts</subject><subject>Data processing</subject><subject>Information management</subject><subject>Intelligence (information)</subject><subject>Optimization</subject><subject>Probability distribution</subject><subject>profit maximization</subject><subject>reserved instances</subject><subject>Resource management</subject><subject>stochastic optimization</subject><issn>2168-7161</issn><issn>2168-7161</issn><issn>2372-0018</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkE1LAzEQhoMoWGrvgpeA562Z7OZjj7pavyoVbM8hzSY1pd3UTQrqr3dLS3Eu7xyedwYehC6BDAFIeTOtqiElUA5pSTllxQnqUeAyE8Dh9N9-jgYxLkk3kkEJZQ-9vrfB-YTf9Ldf-1-dfGhwcPjOL_C9Thq_hHnEvsHVKmxrPIu-WeCPFMynjskbPNmkY-8CnTm9inZwyD6ajR6m1VM2njw-V7fjzNCiSBkrjCyFAysYSMesmdtagKYg5oVmNSfcacE50cAk2NxJoyE3tgYicqc5z_voen9304avrY1JLcO2bbqXinLCBJGc0Y4ie8q0IcbWOrVp_Vq3PwqI2llTnTW1s6YO1rrK1b7irbVHXApGC0LyP0K_ZzY</recordid><startdate>20211001</startdate><enddate>20211001</enddate><creator>Nabavinejad, Seyed Morteza</creator><creator>Goudarzi, Maziar</creator><general>IEEE Computer Society</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-5123-6318</orcidid><orcidid>https://orcid.org/0000-0002-1272-4589</orcidid></search><sort><creationdate>20211001</creationdate><title>Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization</title><author>Nabavinejad, Seyed Morteza ; Goudarzi, Maziar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c244t-54c897f1e7518f5ecbed71a217b4a5d606fa7660a1581e3f8ca13ced1073fa663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Big Data</topic><topic>big data processing</topic><topic>Cloud computing</topic><topic>Contracts</topic><topic>Data processing</topic><topic>Information management</topic><topic>Intelligence (information)</topic><topic>Optimization</topic><topic>Probability distribution</topic><topic>profit maximization</topic><topic>reserved instances</topic><topic>Resource management</topic><topic>stochastic optimization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nabavinejad, Seyed Morteza</creatorcontrib><creatorcontrib>Goudarzi, Maziar</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on cloud computing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Nabavinejad, Seyed Morteza</au><au>Goudarzi, Maziar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization</atitle><jtitle>IEEE transactions on cloud computing</jtitle><stitle>TCC</stitle><date>2021-10-01</date><risdate>2021</risdate><volume>9</volume><issue>4</issue><spage>1563</spage><epage>1574</epage><pages>1563-1574</pages><issn>2168-7161</issn><eissn>2168-7161</eissn><eissn>2372-0018</eissn><coden>ITCCF6</coden><abstract>Reserved instances offered by cloud providers make it possible to reserve resources and computing capacity for a specific period of time. One should pay for all the hours of that time interval; in exchange, the hourly rate is significantly lower than on-demand instances. Reserved Instances can significantly reduce the monetary cost of resources needed to process big data applications in cloud. However, purchases of these instances are non-refundable, and hence, one should be able to estimate the required resources prior to purchase to avoid over-payment. It becomes important especially when the results obtained by big data job has monetary value, such as business intelligence applications. But, estimating the resource demand of big data processing jobs is hard because of numerous factors that affect them such as data locality, data skew, stragglers, internal settings of big data processing framework, interference among instances, instances availability, etc. To maximize the profit of processing such big data jobs in cloud considering fluctuating nature of their resource demand, as well as reserved instances limitations, we propose Reserved Instances Stochastic Allocation (RISA) approach. Using historical traces of resource demand of big data jobs submitted by user, RISA leverages stochastic optimization to determine the amount of resources needed to be reserved for that user to maximize the profit. Our evaluation using real-world traces shows that RISA can increase the net profit by up to 10x, compared to previous approaches. RISA can also find solutions as close as 2 percent to the best possible solution.</abstract><cop>Piscataway</cop><pub>IEEE Computer Society</pub><doi>10.1109/TCC.2019.2926254</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0002-5123-6318</orcidid><orcidid>https://orcid.org/0000-0002-1272-4589</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2168-7161
ispartof IEEE transactions on cloud computing, 2021-10, Vol.9 (4), p.1563-1574
issn 2168-7161
2168-7161
2372-0018
language eng
recordid cdi_ieee_primary_8752400
source IEEE Electronic Library (IEL)
subjects Big Data
big data processing
Cloud computing
Contracts
Data processing
Information management
Intelligence (information)
Optimization
Probability distribution
profit maximization
reserved instances
Resource management
stochastic optimization
title Profit Maximization of Big Data Jobs in Cloud Using Stochastic Optimization
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T11%3A13%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Profit%20Maximization%20of%20Big%20Data%20Jobs%20in%20Cloud%20Using%20Stochastic%20Optimization&rft.jtitle=IEEE%20transactions%20on%20cloud%20computing&rft.au=Nabavinejad,%20Seyed%20Morteza&rft.date=2021-10-01&rft.volume=9&rft.issue=4&rft.spage=1563&rft.epage=1574&rft.pages=1563-1574&rft.issn=2168-7161&rft.eissn=2168-7161&rft.coden=ITCCF6&rft_id=info:doi/10.1109/TCC.2019.2926254&rft_dat=%3Cproquest_RIE%3E2605708652%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2605708652&rft_id=info:pmid/&rft_ieee_id=8752400&rfr_iscdi=true