Apache Calcite-based data consanguinity and impact analysis method and Apache Calcite-based data consanguinity and impact analysis system

The invention provides a data consanguinity and influence analysis method and system based on Apache Calcite, and the method comprises the steps: obtaining metadata information according to collected metadata, and the metadata information comprises a table and a field; integrating Apache Calcite, pe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SHEN MINGJI, LYU LUMING, LI HUIYING, LI TIANJU, ZHOU QUAN, YOU YI, PENG GUANGLIANG, XU YUQIN, WANG YUANXING, GU ANYI, FENG MINGLIANG, LUO LIN, GU JIE, ZHAO HAOYU
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a data consanguinity and influence analysis method and system based on Apache Calcite, and the method comprises the steps: obtaining metadata information according to collected metadata, and the metadata information comprises a table and a field; integrating Apache Calcite, performing lexical and grammatical analysis on the SQL character string of the metadata information, and converting the SQL character string into an abstract syntax tree AST; obtaining a relation graph between the table and the field by using an AST (abstract syntax tree); and performing table-level and field-level blood relationship analysis and influence analysis according to the relationship graph. According to the method, multiple different types of database SQL (Structured Query Language) such as Greenplum and GaussDB are analyzed, aiming at different special grammar such as keywords and functions of different databases, the method has high expandability, and the method can support the analysis of the special gr