Organizing on-disk layout of index structures to support historical keyword search queries over temporally evolving data

Methods, systems, and computer program products for organizing on-disk layout of index structures are provided herein. A method includes organizing an on-disk corpus of data index structures; partitioning each data index structure into an active and/or latest state data partition and an archived dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Deshpande, Prasad M, Subramanian, Suriya, Nandi, Animesh, Sachindran, Narendran
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods, systems, and computer program products for organizing on-disk layout of index structures are provided herein. A method includes organizing an on-disk corpus of data index structures; partitioning each data index structure into an active and/or latest state data partition and an archived data partition; sub-partitioning each of the archived data partitions into sub-partitions based on exact lifespan of each data item; executing a search query that includes temporal predicates and keyword predicates across the corpus by: pruning a collection of identified items of data from the corpus that contain the keyword predicates using range metadata associated with the temporal predicates to generate a pruned list of data items; and performing a list intersection operation on the data index structures to generate a list of common data items from the pruned list; retrieving the common data items from the active and/or latest state data partitions and the archived data sub-partitions.