DIRECT CLOUD STORAGE INTAKE AND UPLOAD ARCHITECTURE

Embodiments of the present disclosure provide a data gathering and query method for collecting ongoing updates to large, unstructured or semi-structured databases. The method comprises gathering a plurality of events defined in a database syntax that is not structured and aggregating the plurality o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Ben-Natan, Ron
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Embodiments of the present disclosure provide a data gathering and query method for collecting ongoing updates to large, unstructured or semi-structured databases. The method comprises gathering a plurality of events defined in a database syntax that is not structured and aggregating the plurality of events into one or more part files. Each of the one or more part files store a subset of the plurality of events in a columnar format, and each of the one or more part files comprises a header file that includes metadata corresponding to a subset of the plurality of events stored in the part file and is separate from the subset of events stored in the part file. The method further comprises uploading the one or more part files to a cloud storage repository configured to store the one or more part files so that they can be queried by a query server based on the header files.