Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources
A central issue of distributed computing systems is how to optimally allocate computing and storage resources and design data shuffling strategies such that the total execution time for computing and data shuffling is minimized. This is extremely critical when the computation, storage and communicat...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2021-02 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Shu-Jie Cao Yi, Lihui Chen, Haoning Wu, Youlong |
description | A central issue of distributed computing systems is how to optimally allocate computing and storage resources and design data shuffling strategies such that the total execution time for computing and data shuffling is minimized. This is extremely critical when the computation, storage and communication resources are limited. In this paper, we study the resource allocation and coding scheme for the MapReduce-type framework with limited resources. In particular, we focus on the coded distributed computing (CDC) approach proposed by Li et al.. We first extend the asymmetric CDC (ACDC) scheme proposed by Yu et al. to the cascade case where each output function is computed by multiple servers. Then we demonstrate that whether CDC or ACDC is better depends on system parameters (e.g., number of computing servers) and task parameters (e.g., number of input files), implying that neither CDC nor ACDC is optimal. By merging the ideas of CDC and ACDC, we propose a hybrid scheme and show that it can strictly outperform CDC and ACDC. Furthermore, we derive an information-theoretic converse showing that for the MapReduce task using a type of weakly symmetric Reduce assignment, which includes the Reduce assignments of CDC and ACDC as special cases, the hybrid scheme with a corresponding resource allocation strategy is optimal, i.e., achieves the minimum execution time, for an arbitrary amount of computing servers and storage memories. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2486136303</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2486136303</sourcerecordid><originalsourceid>FETCH-proquest_journals_24861363033</originalsourceid><addsrcrecordid>eNqNi08LgjAcQEcQJOV3GHQW5qbmNazoEATV3ZbO_Mn-2Dbp62dU907v8N6boIAyFkd5QukMhc51hBCarWiasgBdj70HxSUuTA36js9VK5TAXNf4JJwZbCXwWkpTcQ9G48ZYvAHnLdwGL-rxUv3gP-4JvsUHUPAWv9kt0LTh0onwyzla7raXYh_11jwG4XzZjaEeVUmTPItZxghj_1UvAGNFKQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2486136303</pqid></control><display><type>article</type><title>Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources</title><source>Free E- Journals</source><creator>Shu-Jie Cao ; Yi, Lihui ; Chen, Haoning ; Wu, Youlong</creator><creatorcontrib>Shu-Jie Cao ; Yi, Lihui ; Chen, Haoning ; Wu, Youlong</creatorcontrib><description>A central issue of distributed computing systems is how to optimally allocate computing and storage resources and design data shuffling strategies such that the total execution time for computing and data shuffling is minimized. This is extremely critical when the computation, storage and communication resources are limited. In this paper, we study the resource allocation and coding scheme for the MapReduce-type framework with limited resources. In particular, we focus on the coded distributed computing (CDC) approach proposed by Li et al.. We first extend the asymmetric CDC (ACDC) scheme proposed by Yu et al. to the cascade case where each output function is computed by multiple servers. Then we demonstrate that whether CDC or ACDC is better depends on system parameters (e.g., number of computing servers) and task parameters (e.g., number of input files), implying that neither CDC nor ACDC is optimal. By merging the ideas of CDC and ACDC, we propose a hybrid scheme and show that it can strictly outperform CDC and ACDC. Furthermore, we derive an information-theoretic converse showing that for the MapReduce task using a type of weakly symmetric Reduce assignment, which includes the Reduce assignments of CDC and ACDC as special cases, the hybrid scheme with a corresponding resource allocation strategy is optimal, i.e., achieves the minimum execution time, for an arbitrary amount of computing servers and storage memories.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Coding ; Computer networks ; Distributed processing ; Information theory ; Optimization ; Parameters ; Resource allocation</subject><ispartof>arXiv.org, 2021-02</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Shu-Jie Cao</creatorcontrib><creatorcontrib>Yi, Lihui</creatorcontrib><creatorcontrib>Chen, Haoning</creatorcontrib><creatorcontrib>Wu, Youlong</creatorcontrib><title>Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources</title><title>arXiv.org</title><description>A central issue of distributed computing systems is how to optimally allocate computing and storage resources and design data shuffling strategies such that the total execution time for computing and data shuffling is minimized. This is extremely critical when the computation, storage and communication resources are limited. In this paper, we study the resource allocation and coding scheme for the MapReduce-type framework with limited resources. In particular, we focus on the coded distributed computing (CDC) approach proposed by Li et al.. We first extend the asymmetric CDC (ACDC) scheme proposed by Yu et al. to the cascade case where each output function is computed by multiple servers. Then we demonstrate that whether CDC or ACDC is better depends on system parameters (e.g., number of computing servers) and task parameters (e.g., number of input files), implying that neither CDC nor ACDC is optimal. By merging the ideas of CDC and ACDC, we propose a hybrid scheme and show that it can strictly outperform CDC and ACDC. Furthermore, we derive an information-theoretic converse showing that for the MapReduce task using a type of weakly symmetric Reduce assignment, which includes the Reduce assignments of CDC and ACDC as special cases, the hybrid scheme with a corresponding resource allocation strategy is optimal, i.e., achieves the minimum execution time, for an arbitrary amount of computing servers and storage memories.</description><subject>Coding</subject><subject>Computer networks</subject><subject>Distributed processing</subject><subject>Information theory</subject><subject>Optimization</subject><subject>Parameters</subject><subject>Resource allocation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNi08LgjAcQEcQJOV3GHQW5qbmNazoEATV3ZbO_Mn-2Dbp62dU907v8N6boIAyFkd5QukMhc51hBCarWiasgBdj70HxSUuTA36js9VK5TAXNf4JJwZbCXwWkpTcQ9G48ZYvAHnLdwGL-rxUv3gP-4JvsUHUPAWv9kt0LTh0onwyzla7raXYh_11jwG4XzZjaEeVUmTPItZxghj_1UvAGNFKQ</recordid><startdate>20210202</startdate><enddate>20210202</enddate><creator>Shu-Jie Cao</creator><creator>Yi, Lihui</creator><creator>Chen, Haoning</creator><creator>Wu, Youlong</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210202</creationdate><title>Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources</title><author>Shu-Jie Cao ; Yi, Lihui ; Chen, Haoning ; Wu, Youlong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24861363033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Coding</topic><topic>Computer networks</topic><topic>Distributed processing</topic><topic>Information theory</topic><topic>Optimization</topic><topic>Parameters</topic><topic>Resource allocation</topic><toplevel>online_resources</toplevel><creatorcontrib>Shu-Jie Cao</creatorcontrib><creatorcontrib>Yi, Lihui</creatorcontrib><creatorcontrib>Chen, Haoning</creatorcontrib><creatorcontrib>Wu, Youlong</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shu-Jie Cao</au><au>Yi, Lihui</au><au>Chen, Haoning</au><au>Wu, Youlong</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources</atitle><jtitle>arXiv.org</jtitle><date>2021-02-02</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>A central issue of distributed computing systems is how to optimally allocate computing and storage resources and design data shuffling strategies such that the total execution time for computing and data shuffling is minimized. This is extremely critical when the computation, storage and communication resources are limited. In this paper, we study the resource allocation and coding scheme for the MapReduce-type framework with limited resources. In particular, we focus on the coded distributed computing (CDC) approach proposed by Li et al.. We first extend the asymmetric CDC (ACDC) scheme proposed by Yu et al. to the cascade case where each output function is computed by multiple servers. Then we demonstrate that whether CDC or ACDC is better depends on system parameters (e.g., number of computing servers) and task parameters (e.g., number of input files), implying that neither CDC nor ACDC is optimal. By merging the ideas of CDC and ACDC, we propose a hybrid scheme and show that it can strictly outperform CDC and ACDC. Furthermore, we derive an information-theoretic converse showing that for the MapReduce task using a type of weakly symmetric Reduce assignment, which includes the Reduce assignments of CDC and ACDC as special cases, the hybrid scheme with a corresponding resource allocation strategy is optimal, i.e., achieves the minimum execution time, for an arbitrary amount of computing servers and storage memories.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-02 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2486136303 |
source | Free E- Journals |
subjects | Coding Computer networks Distributed processing Information theory Optimization Parameters Resource allocation |
title | Optimal Coding Scheme and Resource Allocation for Distributed Computation with Limited Resources |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T01%3A06%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Optimal%20Coding%20Scheme%20and%20Resource%20Allocation%20for%20Distributed%20Computation%20with%20Limited%20Resources&rft.jtitle=arXiv.org&rft.au=Shu-Jie%20Cao&rft.date=2021-02-02&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2486136303%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2486136303&rft_id=info:pmid/&rfr_iscdi=true |