HFD: Hierarchical feature decoupling for SQL generation from text

Text-to-SQL, a computational linguistics task, seeks to facilitate the conversion of natural language queries into SQL queries. Recent methodologies have leveraged the concept of slot-filling in conjunction with predetermined SQL templates to effectively bridge the semantic gap between natural langu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Intelligent data analysis 2024-01, Vol.28 (4), p.991-1005
Hauptverfasser:	Zhang, Xu, Hu, Xiaoyu, Liu, Zejie, Xiang, Yanzheng, Zhou, Deyu
Format:	Artikel
Sprache:	eng
Schlagworte:	Decoupling Linguistics Natural language Natural language processing Queries Query languages Speech recognition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1005
container_issue	4
container_start_page	991
container_title	Intelligent data analysis
container_volume	28
creator	Zhang, Xu Hu, Xiaoyu Liu, Zejie Xiang, Yanzheng Zhou, Deyu
description	Text-to-SQL, a computational linguistics task, seeks to facilitate the conversion of natural language queries into SQL queries. Recent methodologies have leveraged the concept of slot-filling in conjunction with predetermined SQL templates to effectively bridge the semantic gap between natural language questions and structured database queries, achieving commendable performance by harnessing the power of multi-task learning. However, employing identical features across diverse tasks is an ill-suited practice, fraught with inherent drawbacks. Firstly, based on our observation, there are clear boundaries in the natural language corresponding to SELECT and WHERE clauses. Secondly, the exclusive features integral to each subtask are inadequately emphasized and underutilized, thereby hampering the acquisition of discriminative features for each specific subtask. In an endeavor to rectify these issues, the present work introduces an innovative approach: the hierarchical feature decoupling model for SQL query generation from natural language. This novel approach involves the deliberate separation of features pertaining to subtasks within both SELECT and WHERE clauses, further dissociating these features at the subtask level to foster better model performance. Empirical results derived from experiments conducted on the WikiSQL benchmark dataset reveal the superiority of the proposed approach over several state-of-the-art baseline methods in the context of text-to-SQL query generation.
doi_str_mv	10.3233/IDA-230390
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3082612099</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.3233_IDA-230390</sage_id><sourcerecordid>3082612099</sourcerecordid><originalsourceid>FETCH-LOGICAL-c180t-97ecb6572a8929e030b35d3ed4c633089bb83bf34271f0ff89a0beda4e7c0e9d3</originalsourceid><addsrcrecordid>eNpt0E1LAzEQBuAgCtbqxV8Q8CAIq5OP7ibeSmutUBBRwVvIZid1S7tbk13Qf29kBS-eZg7PvAMvIecMrgUX4uZhPs24AKHhgIzYpGCZZFwdph2UymRevB2Tkxg3ACA5yBGZLhfzW7qsMdjg3mtnt9Sj7fqAtELX9vtt3aypbwN9flrRNTYJdnXbUB_aHe3wszslR95uI579zjF5Xdy9zJbZ6vH-YTZdZY4p6DJdoCvzScGt0lwjCCjFpBJYSZcLAUqXpRKlF5IXzIP3SlsosbISCweoKzEmF0PuPrQfPcbObNo-NOmlSec8Zxy0TupqUC60MQb0Zh_qnQ1fhoH5qcikisxQUcKXA452jX9x_8hvUzpjqQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3082612099</pqid></control><display><type>article</type><title>HFD: Hierarchical feature decoupling for SQL generation from text</title><source>Business Source Complete</source><creator>Zhang, Xu ; Hu, Xiaoyu ; Liu, Zejie ; Xiang, Yanzheng ; Zhou, Deyu</creator><creatorcontrib>Zhang, Xu ; Hu, Xiaoyu ; Liu, Zejie ; Xiang, Yanzheng ; Zhou, Deyu</creatorcontrib><description>Text-to-SQL, a computational linguistics task, seeks to facilitate the conversion of natural language queries into SQL queries. Recent methodologies have leveraged the concept of slot-filling in conjunction with predetermined SQL templates to effectively bridge the semantic gap between natural language questions and structured database queries, achieving commendable performance by harnessing the power of multi-task learning. However, employing identical features across diverse tasks is an ill-suited practice, fraught with inherent drawbacks. Firstly, based on our observation, there are clear boundaries in the natural language corresponding to SELECT and WHERE clauses. Secondly, the exclusive features integral to each subtask are inadequately emphasized and underutilized, thereby hampering the acquisition of discriminative features for each specific subtask. In an endeavor to rectify these issues, the present work introduces an innovative approach: the hierarchical feature decoupling model for SQL query generation from natural language. This novel approach involves the deliberate separation of features pertaining to subtasks within both SELECT and WHERE clauses, further dissociating these features at the subtask level to foster better model performance. Empirical results derived from experiments conducted on the WikiSQL benchmark dataset reveal the superiority of the proposed approach over several state-of-the-art baseline methods in the context of text-to-SQL query generation.</description><identifier>ISSN: 1088-467X</identifier><identifier>EISSN: 1571-4128</identifier><identifier>DOI: 10.3233/IDA-230390</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Decoupling ; Linguistics ; Natural language ; Natural language processing ; Queries ; Query languages ; Speech recognition</subject><ispartof>Intelligent data analysis, 2024-01, Vol.28 (4), p.991-1005</ispartof><rights>2024 – IOS Press. All rights reserved.</rights><rights>Copyright IOS Press BV 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c180t-97ecb6572a8929e030b35d3ed4c633089bb83bf34271f0ff89a0beda4e7c0e9d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Zhang, Xu</creatorcontrib><creatorcontrib>Hu, Xiaoyu</creatorcontrib><creatorcontrib>Liu, Zejie</creatorcontrib><creatorcontrib>Xiang, Yanzheng</creatorcontrib><creatorcontrib>Zhou, Deyu</creatorcontrib><title>HFD: Hierarchical feature decoupling for SQL generation from text</title><title>Intelligent data analysis</title><description>Text-to-SQL, a computational linguistics task, seeks to facilitate the conversion of natural language queries into SQL queries. Recent methodologies have leveraged the concept of slot-filling in conjunction with predetermined SQL templates to effectively bridge the semantic gap between natural language questions and structured database queries, achieving commendable performance by harnessing the power of multi-task learning. However, employing identical features across diverse tasks is an ill-suited practice, fraught with inherent drawbacks. Firstly, based on our observation, there are clear boundaries in the natural language corresponding to SELECT and WHERE clauses. Secondly, the exclusive features integral to each subtask are inadequately emphasized and underutilized, thereby hampering the acquisition of discriminative features for each specific subtask. In an endeavor to rectify these issues, the present work introduces an innovative approach: the hierarchical feature decoupling model for SQL query generation from natural language. This novel approach involves the deliberate separation of features pertaining to subtasks within both SELECT and WHERE clauses, further dissociating these features at the subtask level to foster better model performance. Empirical results derived from experiments conducted on the WikiSQL benchmark dataset reveal the superiority of the proposed approach over several state-of-the-art baseline methods in the context of text-to-SQL query generation.</description><subject>Decoupling</subject><subject>Linguistics</subject><subject>Natural language</subject><subject>Natural language processing</subject><subject>Queries</subject><subject>Query languages</subject><subject>Speech recognition</subject><issn>1088-467X</issn><issn>1571-4128</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpt0E1LAzEQBuAgCtbqxV8Q8CAIq5OP7ibeSmutUBBRwVvIZid1S7tbk13Qf29kBS-eZg7PvAMvIecMrgUX4uZhPs24AKHhgIzYpGCZZFwdph2UymRevB2Tkxg3ACA5yBGZLhfzW7qsMdjg3mtnt9Sj7fqAtELX9vtt3aypbwN9flrRNTYJdnXbUB_aHe3wszslR95uI579zjF5Xdy9zJbZ6vH-YTZdZY4p6DJdoCvzScGt0lwjCCjFpBJYSZcLAUqXpRKlF5IXzIP3SlsosbISCweoKzEmF0PuPrQfPcbObNo-NOmlSec8Zxy0TupqUC60MQb0Zh_qnQ1fhoH5qcikisxQUcKXA452jX9x_8hvUzpjqQ</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Zhang, Xu</creator><creator>Hu, Xiaoyu</creator><creator>Liu, Zejie</creator><creator>Xiang, Yanzheng</creator><creator>Zhou, Deyu</creator><general>SAGE Publications</general><general>IOS Press BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20240101</creationdate><title>HFD: Hierarchical feature decoupling for SQL generation from text</title><author>Zhang, Xu ; Hu, Xiaoyu ; Liu, Zejie ; Xiang, Yanzheng ; Zhou, Deyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c180t-97ecb6572a8929e030b35d3ed4c633089bb83bf34271f0ff89a0beda4e7c0e9d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Decoupling</topic><topic>Linguistics</topic><topic>Natural language</topic><topic>Natural language processing</topic><topic>Queries</topic><topic>Query languages</topic><topic>Speech recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Xu</creatorcontrib><creatorcontrib>Hu, Xiaoyu</creatorcontrib><creatorcontrib>Liu, Zejie</creatorcontrib><creatorcontrib>Xiang, Yanzheng</creatorcontrib><creatorcontrib>Zhou, Deyu</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Intelligent data analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Xu</au><au>Hu, Xiaoyu</au><au>Liu, Zejie</au><au>Xiang, Yanzheng</au><au>Zhou, Deyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>HFD: Hierarchical feature decoupling for SQL generation from text</atitle><jtitle>Intelligent data analysis</jtitle><date>2024-01-01</date><risdate>2024</risdate><volume>28</volume><issue>4</issue><spage>991</spage><epage>1005</epage><pages>991-1005</pages><issn>1088-467X</issn><eissn>1571-4128</eissn><abstract>Text-to-SQL, a computational linguistics task, seeks to facilitate the conversion of natural language queries into SQL queries. Recent methodologies have leveraged the concept of slot-filling in conjunction with predetermined SQL templates to effectively bridge the semantic gap between natural language questions and structured database queries, achieving commendable performance by harnessing the power of multi-task learning. However, employing identical features across diverse tasks is an ill-suited practice, fraught with inherent drawbacks. Firstly, based on our observation, there are clear boundaries in the natural language corresponding to SELECT and WHERE clauses. Secondly, the exclusive features integral to each subtask are inadequately emphasized and underutilized, thereby hampering the acquisition of discriminative features for each specific subtask. In an endeavor to rectify these issues, the present work introduces an innovative approach: the hierarchical feature decoupling model for SQL query generation from natural language. This novel approach involves the deliberate separation of features pertaining to subtasks within both SELECT and WHERE clauses, further dissociating these features at the subtask level to foster better model performance. Empirical results derived from experiments conducted on the WikiSQL benchmark dataset reveal the superiority of the proposed approach over several state-of-the-art baseline methods in the context of text-to-SQL query generation.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><doi>10.3233/IDA-230390</doi><tpages>15</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1088-467X
ispartof	Intelligent data analysis, 2024-01, Vol.28 (4), p.991-1005
issn	1088-467X 1571-4128
language	eng
recordid	cdi_proquest_journals_3082612099
source	Business Source Complete
subjects	Decoupling Linguistics Natural language Natural language processing Queries Query languages Speech recognition
title	HFD: Hierarchical feature decoupling for SQL generation from text
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T03%3A15%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=HFD:%20Hierarchical%20feature%20decoupling%20for%20SQL%20generation%20from%20text&rft.jtitle=Intelligent%20data%20analysis&rft.au=Zhang,%20Xu&rft.date=2024-01-01&rft.volume=28&rft.issue=4&rft.spage=991&rft.epage=1005&rft.pages=991-1005&rft.issn=1088-467X&rft.eissn=1571-4128&rft_id=info:doi/10.3233/IDA-230390&rft_dat=%3Cproquest_cross%3E3082612099%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3082612099&rft_id=info:pmid/&rft_sage_id=10.3233_IDA-230390&rfr_iscdi=true