Type- and Content-Driven Synthesis of SQL Queries from Natural Language
This paper presents a new technique for automatically synthesizing SQL queries from natural language. Our technique is fully automated, works for any database without requiring additional customization, and does not require users to know the underlying database schema. Our method achieves these goal...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper presents a new technique for automatically synthesizing SQL
queries from natural language. Our technique is fully automated, works for any
database without requiring additional customization, and does not require users
to know the underlying database schema. Our method achieves these goals by
combining natural language processing, program synthesis, and automated program
repair. Given the user's English description, our technique first uses semantic
parsing to generate a query sketch, which is subsequently completed using
type-directed program synthesis and assigned a confidence score using database
contents. However, since the user's description may not accurately reflect the
actual database schema, our approach also performs fault localization and
repairs the erroneous part of the sketch. This synthesize-repair loop is
repeated until the algorithm infers a query with a sufficiently high confidence
score. We have implemented the proposed technique in a tool called Sqlizer and
evaluate it on three different databases. Our experiments show that the desired
query is ranked within the top 5 candidates in close to 90% of the cases. |
---|---|
DOI: | 10.48550/arxiv.1702.01168 |