Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs

•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2020-05, Vol.146, p.113205, Article 113205
Hauptverfasser: Bakhshi, Mahdi, Nematbakhsh, Mohammadali, Mohsenzadeh, Mehran, Rahmani, Amir Masoud
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 113205
container_title Expert systems with applications
container_volume 146
creator Bakhshi, Mahdi
Nematbakhsh, Mohammadali
Mohsenzadeh, Mehran
Rahmani, Amir Masoud
description •Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.
doi_str_mv 10.1016/j.eswa.2020.113205
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2431028710</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417420300312</els_id><sourcerecordid>2431028710</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</originalsourceid><addsrcrecordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2431028710</pqid></control><display><type>article</type><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creator><creatorcontrib>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creatorcontrib><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2020.113205</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Alignment ; Answering natural language questions ; Complexity ; Datasets ; Disambiguation of interpretations ; Graph theory ; Graphs ; Knowledge bases (artificial intelligence) ; Knowledge graph ; Mapping ; Pairwise graph alignment ; Queries ; Questions ; Similarity</subject><ispartof>Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205</ispartof><rights>2020</rights><rights>Copyright Elsevier BV May 15, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</citedby><cites>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</cites><orcidid>0000-0001-8641-6119</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2020.113205$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><title>Expert systems with applications</title><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><subject>Alignment</subject><subject>Answering natural language questions</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Disambiguation of interpretations</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Knowledge bases (artificial intelligence)</subject><subject>Knowledge graph</subject><subject>Mapping</subject><subject>Pairwise graph alignment</subject><subject>Queries</subject><subject>Questions</subject><subject>Similarity</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</recordid><startdate>20200515</startdate><enddate>20200515</enddate><creator>Bakhshi, Mahdi</creator><creator>Nematbakhsh, Mohammadali</creator><creator>Mohsenzadeh, Mehran</creator><creator>Rahmani, Amir Masoud</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></search><sort><creationdate>20200515</creationdate><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><author>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Alignment</topic><topic>Answering natural language questions</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Disambiguation of interpretations</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Knowledge bases (artificial intelligence)</topic><topic>Knowledge graph</topic><topic>Mapping</topic><topic>Pairwise graph alignment</topic><topic>Queries</topic><topic>Questions</topic><topic>Similarity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bakhshi, Mahdi</au><au>Nematbakhsh, Mohammadali</au><au>Mohsenzadeh, Mehran</au><au>Rahmani, Amir Masoud</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</atitle><jtitle>Expert systems with applications</jtitle><date>2020-05-15</date><risdate>2020</risdate><volume>146</volume><spage>113205</spage><pages>113205-</pages><artnum>113205</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly. As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2020.113205</doi><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_journals_2431028710
source ScienceDirect Journals (5 years ago - present)
subjects Alignment
Answering natural language questions
Complexity
Datasets
Disambiguation of interpretations
Graph theory
Graphs
Knowledge bases (artificial intelligence)
Knowledge graph
Mapping
Pairwise graph alignment
Queries
Questions
Similarity
title Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T08%3A09%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data-driven%20construction%20of%20SPARQL%20queries%20by%20approximate%20question%20graph%20alignment%20in%20question%20answering%20over%20knowledge%20graphs&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Bakhshi,%20Mahdi&rft.date=2020-05-15&rft.volume=146&rft.spage=113205&rft.pages=113205-&rft.artnum=113205&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2020.113205&rft_dat=%3Cproquest_cross%3E2431028710%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2431028710&rft_id=info:pmid/&rft_els_id=S0957417420300312&rfr_iscdi=true