Indexing Time-Evolving Data With Variable Lifetimes

Many applications store data items for a pre-determined, finite length of time. Examples include sliding windows over online data streams, where old data are dropped as the window slides forward. Previous research on management of data with finite lifetimes has emphasized online query processing in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Golab, L., Prahladka, P., Ozsu, M.T.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Many applications store data items for a pre-determined, finite length of time. Examples include sliding windows over online data streams, where old data are dropped as the window slides forward. Previous research on management of data with finite lifetimes has emphasized online query processing in main memory. In this paper, we address the problem of indexing time-evolving data on disk for offline analysis. In order to reduce the I/O costs of index updates, existing work partitions the data chronologically. This way, only the oldest partition is examined for expirations, only the youngest partition incurs insertions, and the remaining partitions "in the middle" are not accessed. However, this solution is based upon the assumption that the order in which the data are inserted is equivalent to the expiration order, which means that the lifetime of each data item is the same. We motivate the need to break this assumption, demonstrate that the existing solutions no longer apply, and propose new index partitioning strategies that yield low update costs and fast access times
ISSN:1551-6393
DOI:10.1109/SSDBM.2006.29