Online machine learning for auto-scaling in the edge computing

The evolution of edge computing devices has enabled machine intelligence techniques to process data close to its producers (the sensors) and end-users. Although edge devices are usually resource-constrained, the distribution of processing services among several nodes enables a processing capacity si...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pervasive and mobile computing 2022-12, Vol.87, p.101722, Article 101722
Hauptverfasser: Silva, Thiago Pereira da, Neto, Aluizio Rocha, Batista, Thais Vasconcelos, Delicato, Flávia C., Pires, Paulo F., Lopes, Frederico
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The evolution of edge computing devices has enabled machine intelligence techniques to process data close to its producers (the sensors) and end-users. Although edge devices are usually resource-constrained, the distribution of processing services among several nodes enables a processing capacity similar to cloud environments. However, the edge computing environment is highly dynamic, impacting the availability of nodes in the distributed system. In addition, the processing workload for each node can change constantly. Thus, the scaling of processing services needs to be rapidly adjusted, avoiding bottlenecks or wasted resources while meeting the applications’ QoS requirements. This paper presents an auto-scaling subsystem for container-based processing services using online machine learning. The auto-scaling follows the MAPE-K control loop to dynamically adjust the number of containers in response to workload changes. We designed the approach for scenarios where the number of processing requests is unknown beforehand. We developed a hybrid auto-scaling mechanism that behaves reactively while a prediction online machine learning model is continuously trained. When the prediction model reaches a desirable performance, the auto-scaling acts proactively, using predictions to anticipate scaling actions. An experimental evaluation has demonstrated the feasibility of the architecture. Our solution achieved fewer service level agreement (SLA) violations and scaling operations to meet demand than purely reactive and no scaling approaches using an actual application workload. Also, our solution wasted fewer resources compared to the other techniques. •We specify a software subsystem that incorporates an auto-scaling approach.•We designed an online machine learning-based auto-scaling.•The auto-scaling is incorporated into a distributed video analytics system.•The auto-scaling changes its behavior from reactive to proactive and vice versa.•The auto-scaling follows the MAPE-K autonomic control loop.•We present a performance evaluation using a video analytics scenario.
ISSN:1574-1192
1873-1589
DOI:10.1016/j.pmcj.2022.101722