LIFECYCLE SUPPORT FOR METADATA-BASED DATA INGESTION

Metadata-based parsing generates a schema that is used to ingest data from an incoming object. Embodiments extend metadata-based parsing to support changes to a generated schema based on the lifecycle of the object. A schema of an incoming data object is compared with existing schema (if any) to det...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Filho, Roberto Usberti, Wasiutinski, Vladimir
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Metadata-based parsing generates a schema that is used to ingest data from an incoming object. Embodiments extend metadata-based parsing to support changes to a generated schema based on the lifecycle of the object. A schema of an incoming data object is compared with existing schema (if any) to determine differences therebetween. Detection of minor differences reveals mere evolution of an existing data object, triggering adjustment of the existing schema (e.g., to ⋅add a new field, ⋅deprecate an existing field, ⋅change a format of an existing field, and/or ⋅delete an existing field). Detection of more substantive differences from an existing schema can result in more substantive changes being made according to stored logic. Embodiments improve system operation by conserving effort and cost that would otherwise be consumed to perform metadata-based parsing and data ingestion of all incoming objects, regardless of their similarity to existing schema.