Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs
•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibi...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2020-05, Vol.146, p.113205, Article 113205 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 113205 |
container_title | Expert systems with applications |
container_volume | 146 |
creator | Bakhshi, Mahdi Nematbakhsh, Mohammadali Mohsenzadeh, Mehran Rahmani, Amir Masoud |
description | •Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly.
As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly. |
doi_str_mv | 10.1016/j.eswa.2020.113205 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2431028710</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417420300312</els_id><sourcerecordid>2431028710</sourcerecordid><originalsourceid>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</originalsourceid><addsrcrecordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2431028710</pqid></control><display><type>article</type><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creator><creatorcontrib>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</creatorcontrib><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly.
As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2020.113205</identifier><language>eng</language><publisher>New York: Elsevier Ltd</publisher><subject>Alignment ; Answering natural language questions ; Complexity ; Datasets ; Disambiguation of interpretations ; Graph theory ; Graphs ; Knowledge bases (artificial intelligence) ; Knowledge graph ; Mapping ; Pairwise graph alignment ; Queries ; Questions ; Similarity</subject><ispartof>Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205</ispartof><rights>2020</rights><rights>Copyright Elsevier BV May 15, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</citedby><cites>FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</cites><orcidid>0000-0001-8641-6119</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2020.113205$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><title>Expert systems with applications</title><description>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly.
As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</description><subject>Alignment</subject><subject>Answering natural language questions</subject><subject>Complexity</subject><subject>Datasets</subject><subject>Disambiguation of interpretations</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Knowledge bases (artificial intelligence)</subject><subject>Knowledge graph</subject><subject>Mapping</subject><subject>Pairwise graph alignment</subject><subject>Queries</subject><subject>Questions</subject><subject>Similarity</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kE9rGzEQxUVJoI7bL9CTIOd1R9KutQu9mKRpA4b87VlotSNXri050tquL_3skb2B3nIamHm_mTePkC8MJgzY9OtygmmvJxx4bjDBofpARqyWopjKRpyRETSVLEomy4_kIqUlAJMAckT-XeteF110O_TUBJ_6uDW9C54GS5_uZ48Pc_qyxegw0fZA9WYTw1-31j0e2-mkXES9-U31yi38Gn1Pnf8_0z7tM-0XNOww0j8-7FfYLXCA0idybvUq4ee3Oia_br4_X_0s5nc_bq9m88IIXvdFqxvobM1tKRtWlgJqlGBRM1s3KNtKCybFtK55nnNr2ilAZbnh-fdKiMaKMbkc9mb7J29qGbbR55OKl4IBryWDrOKDysSQUkSrNjH_Gg-KgTrmrJbqmLM65qyGnDP0bYAw-985jCoZh95g5yKaXnXBvYe_AtO8h_M</recordid><startdate>20200515</startdate><enddate>20200515</enddate><creator>Bakhshi, Mahdi</creator><creator>Nematbakhsh, Mohammadali</creator><creator>Mohsenzadeh, Mehran</creator><creator>Rahmani, Amir Masoud</creator><general>Elsevier Ltd</general><general>Elsevier BV</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></search><sort><creationdate>20200515</creationdate><title>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</title><author>Bakhshi, Mahdi ; Nematbakhsh, Mohammadali ; Mohsenzadeh, Mehran ; Rahmani, Amir Masoud</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c328t-ba90df82f479144308e70fea1f89e7b5a317368824792fcb6005f2c27935339f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Alignment</topic><topic>Answering natural language questions</topic><topic>Complexity</topic><topic>Datasets</topic><topic>Disambiguation of interpretations</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Knowledge bases (artificial intelligence)</topic><topic>Knowledge graph</topic><topic>Mapping</topic><topic>Pairwise graph alignment</topic><topic>Queries</topic><topic>Questions</topic><topic>Similarity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bakhshi, Mahdi</creatorcontrib><creatorcontrib>Nematbakhsh, Mohammadali</creatorcontrib><creatorcontrib>Mohsenzadeh, Mehran</creatorcontrib><creatorcontrib>Rahmani, Amir Masoud</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bakhshi, Mahdi</au><au>Nematbakhsh, Mohammadali</au><au>Mohsenzadeh, Mehran</au><au>Rahmani, Amir Masoud</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs</atitle><jtitle>Expert systems with applications</jtitle><date>2020-05-15</date><risdate>2020</risdate><volume>146</volume><spage>113205</spage><pages>113205-</pages><artnum>113205</artnum><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>•Extracting the query graph patterns directly from the knowledge graph is proposed.•Pairwise graph alignment-based method verifies the interpretations of a question.•Relying on the intermediate semantic items is reduced to as least as possible.•The lexical and structural approximation causes flexibility in finding matches.•Results show a significant increase in the number of questions answered correctly.
As increasingly more semantic real-world data is stored in knowledge graphs, providing intuitive and effective query methods for end-users is a fundamental and challenging task. Since there is a gap between the plain natural language question (NLQ) and structured data, most RDF question/answering (Q/A) systems construct SPARQL queries from NLQs and obtain precise answers from knowledge graphs. A major challenge is how to disambiguate the mapping of phrases and relations in a question to the dataset items, especially in complex questions. In this paper, we propose a novel data-driven graph similarity framework for RDF Q/A to extract the query graph patterns directly from the knowledge graph instead of constructing them with semantically mapped items. An uncertain question graph is presented to model the interpretations of an NLQ, based on which our problem is reduced to a graph alignment problem. In formulating the alignment, both the lexical and structural similarity of graphs are considered, hence, the target RDF subgraph is used as a query graph pattern to construct the final query. We create a pruned entity graph dynamically based on the complexity of an input question to reduce the search space on the knowledge graph. Moreover, to reduce the calculating cost of the graph similarity, we compute the similarity scores only for same-distance graph elements and equip the process with an edge association-aware surface form extraction method. Empirical studies over real datasets indicate that our proposed approach is flexible and effective as it outperforms state-of-the-art methods significantly.</abstract><cop>New York</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2020.113205</doi><orcidid>https://orcid.org/0000-0001-8641-6119</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2020-05, Vol.146, p.113205, Article 113205 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_journals_2431028710 |
source | ScienceDirect Journals (5 years ago - present) |
subjects | Alignment Answering natural language questions Complexity Datasets Disambiguation of interpretations Graph theory Graphs Knowledge bases (artificial intelligence) Knowledge graph Mapping Pairwise graph alignment Queries Questions Similarity |
title | Data-driven construction of SPARQL queries by approximate question graph alignment in question answering over knowledge graphs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T08%3A09%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data-driven%20construction%20of%20SPARQL%20queries%20by%20approximate%20question%20graph%20alignment%20in%20question%20answering%20over%20knowledge%20graphs&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Bakhshi,%20Mahdi&rft.date=2020-05-15&rft.volume=146&rft.spage=113205&rft.pages=113205-&rft.artnum=113205&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2020.113205&rft_dat=%3Cproquest_cross%3E2431028710%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2431028710&rft_id=info:pmid/&rft_els_id=S0957417420300312&rfr_iscdi=true |