Edge/Cloud Infinite-Time Horizon Resource Allocation for Distributed Machine Learning and General Tasks

Edge computing has emerged as a computing paradigm where the application and data processing takes place close to the end devices. It decreases the distances over which data transfers are made, offering reduced delay and fast speed of action for general data processing and store/retrieve jobs. The b...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE eTransactions on network and service management 2024-02, Vol.21 (1), p.697-713
Hauptverfasser:	Sartzetakis, Ippokratis, Soumplis, Polyzois, Pantazopoulos, Panagiotis, Katsaros, Konstantinos V., Sourlas, Vasilis, Varvarigos, Emmanouel
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Cloud and edge computing Cloud computing Computational modeling Costs Data processing Delay distributed computing distributed machine learning Edge computing inference Integer programming Internet of Things Linear programming Machine learning Parameters Resource allocation Resource management Task analysis Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Edge computing has emerged as a computing paradigm where the application and data processing takes place close to the end devices. It decreases the distances over which data transfers are made, offering reduced delay and fast speed of action for general data processing and store/retrieve jobs. The benefits of edge computing can also be reaped for distributed computation algorithms, where the cloud also plays an assistive role. In this context, an important challenge is to allocate the required resources at both edge and cloud to carry out the processing of data that are generated over a continuous ("infinite") time horizon. This is a complex problem due to the variety of requirements (resource needs, accuracy, delay, etc.) that may be posed by each computation algorithm, as well as the heterogeneous resources' features (e.g., processing, bandwidth). In this work, we develop a solution for serving weakly coupled general distributed algorithms, with emphasis on machine learning algorithms, at the edge and/or the cloud. We present a dual-objective Integer Linear Programming formulation that optimizes monetary cost and computation accuracy. We also introduce efficient heuristics to perform the resource allocation. We examine various distributed ML allocation scenarios using realistic parameters from actual vendors. We quantify trade-offs related to accuracy, performance and cost of edge/cloud bandwidth and processing resources. Our results indicate that among the many parameters of interest, the processing costs seem to play the most important role for the allocation decisions. Finally, we explore interesting interactions between target accuracy, monetary cost and delay.
ISSN:	1932-4537 1932-4537
DOI:	10.1109/TNSM.2023.3312593