Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs

•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Expert systems with applications 2020-05, Vol.146, p.113205, Article 113205
Hauptverfasser:	Bakhshi, Mahdi, Nematbakhsh, Mohammadali, Mohsenzadeh, Mehran, Rahmani, Amir Masoud
Format:	Artikel
Sprache:	eng
Schlagworte:	Alignment Answering natural language questions Complexity Datasets Disambiguation of interpretations Graph theory Graphs Knowledge bases (artificial intelligence) Knowledge graph Mapping Pairwise graph alignment Queries Questions Similarity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	113205
container_title	Expert systems with applications
container_volume	146
creator	Bakhshi, Mahdi Nematbakhsh, Mohammadali Mohsenzadeh, Mehran Rahmani, Amir Masoud
description	•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.
doi_str_mv	10.1016/j.eswa.2020.113205
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2431028710</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417420300312</els_id><sourcerecordid>2431028710</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</originalsourceid><addsrcrecordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2431028710</pqid></control><display><type>article</type><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creator><creatorcontrib>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creatorcontrib><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2020.113205</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Alignment ; Answering natural language questions ; Complexity ; Datasets ; Disambiguation of interpretations ; Graph theory ; Graphs ; Knowledge bases (artificial intelligence) ; Knowledge graph ; Mapping ; Pairwise graph alignment ; Queries ; Questions ; Similarity</subject><ispartof>Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205</ispartof><rights>2020</rights><rights>Copyright Elsevier BV May 15, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</citedby><cites>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</cites><orcidid>0000-0001-8641-6119</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2020.113205$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><title>Expert systems with applications</title><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><subject>Alignment</subject><subject>Answering natural language questions</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Disambiguation of interpretations</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Knowledge bases (artificial intelligence)</subject><subject>Knowledge graph</subject><subject>Mapping</subject><subject>Pairwise graph alignment</subject><subject>Queries</subject><subject>Questions</subject><subject>Similarity</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</recordid><startdate>20200515</startdate><enddate>20200515</enddate><creator>Bakhshi, Mahdi</creator><creator>Nematbakhsh, Mohammadali</creator><creator>Mohsenzadeh, Mehran</creator><creator>Rahmani, Amir Masoud</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></search><sort><creationdate>20200515</creationdate><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><author>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Alignment</topic><topic>Answering natural language questions</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Disambiguation of interpretations</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Knowledge bases (artificial intelligence)</topic><topic>Knowledge graph</topic><topic>Mapping</topic><topic>Pairwise graph alignment</topic><topic>Queries</topic><topic>Questions</topic><topic>Similarity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bakhshi, Mahdi</au><au>Nematbakhsh, Mohammadali</au><au>Mohsenzadeh, Mehran</au><au>Rahmani, Amir Masoud</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</atitle><jtitle>Expert systems with applications</jtitle><date>2020-05-15</date><risdate>2020</risdate><volume>146</volume><spage>113205</spage><pages>113205-</pages><artnum>113205</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2020.113205</doi><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0957-4174
ispartof	Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205
issn	0957-4174 1873-6793
language	eng
recordid	cdi_proquest_journals_2431028710
source	ScienceDirect Journals (5 years ago - present)
subjects	Alignment Answering natural language questions Complexity Datasets Disambiguation of interpretations Graph theory Graphs Knowledge bases (artificial intelligence) Knowledge graph Mapping Pairwise graph alignment Queries Questions Similarity
title	Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T08%3A09%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data-driven%20construction%20of%20SPARQL%20queries%20by%20approximate%20question%20graph%20alignment%20in%20question%20answering%20over%20knowledge%20graphs&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Bakhshi,%20Mahdi&rft.date=2020-05-15&rft.volume=146&rft.spage=113205&rft.pages=113205-&rft.artnum=113205&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2020.113205&rft_dat=%3Cproquest_cross%3E2431028710%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2431028710&rft_id=info:pmid/&rft_els_id=S0957417420300312&rfr_iscdi=true