A Roadmap towards Intelligent Operations for Reliable Cloud Computing Systems
The increasing complexity and usage of cloud systems have made it challenging for service providers to ensure reliability. This paper highlights two main challenges, namely internal and external factors, that affect the reliability of cloud microservices. Afterward, we discuss the data-driven approa...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The increasing complexity and usage of cloud systems have made it challenging
for service providers to ensure reliability. This paper highlights two main
challenges, namely internal and external factors, that affect the reliability
of cloud microservices. Afterward, we discuss the data-driven approach that can
resolve these challenges from four key aspects: ticket management, log
management, multimodal analysis, and the microservice resilience testing
approach. The experiments conducted show that the proposed data-driven AIOps
solution significantly enhances system reliability from multiple angles. |
---|---|
DOI: | 10.48550/arxiv.2310.00677 |