Smart Caching in a Data Lake for High Energy Physics Analysis

The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of grid computing 2023-09, Vol.21 (3), p.42, Article 42
Hauptverfasser: Tedeschi, Tommaso, Baioletti, Marco, Ciangottini, Diego, Poggioni, Valentina, Spiga, Daniele, Storchi, Loriano, Tracolli, Mirco
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a Data Lake infrastructure in the context of the High Energy Physics field. We are proposing an autonomous method, based on Reinforcement Learning techniques, to improve the user experience and to contain the maintenance costs of the infrastructure.
ISSN:1570-7873
1572-9184
DOI:10.1007/s10723-023-09664-z