Trav-SHACL: Efficiently Validating Networks of SHACL Constraints

Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommend...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Figuera, Mónica, Rohde, Philipp D, Vidal, Maria-Esther
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Figuera, Mónica
Rohde, Philipp D
Vidal, Maria-Esther
description Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for the fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.
doi_str_mv 10.48550/arxiv.2101.07136
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_07136</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_07136</sourcerecordid><originalsourceid>FETCH-LOGICAL-a676-e923e253ff9c4764e94bf1fdc6a08a17a62fa0231dafb7df428e3561d6dee3cf3</originalsourceid><addsrcrecordid>eNotz71OwzAUhmEvHVDhApjwDST4L3bCRBUVihTBQMQancbnIIs0QY5V6N1XDUzf8uqTHsZupchNWRTiHuJvOOZKCpkLJ7W9Yo9thGP2vtvUzQPfEoU-4JiGE_-AIXhIYfzkr5h-pvg184n4UvJ6GucUIYxpvmYrgmHGm_9ds_Zp29a7rHl7fqk3TQbW2QwrpVEVmqjqjbMGK7MnSb63IEqQDqwiEEpLD7R3nowqURdWeusRdU96ze7-bhdC9x3DAeKpu1C6haLPc3FDgw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Trav-SHACL: Efficiently Validating Networks of SHACL Constraints</title><source>arXiv.org</source><creator>Figuera, Mónica ; Rohde, Philipp D ; Vidal, Maria-Esther</creator><creatorcontrib>Figuera, Mónica ; Rohde, Philipp D ; Vidal, Maria-Esther</creatorcontrib><description>Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for the fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.</description><identifier>DOI: 10.48550/arxiv.2101.07136</identifier><language>eng</language><subject>Computer Science - Databases</subject><creationdate>2021-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.07136$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.07136$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Figuera, Mónica</creatorcontrib><creatorcontrib>Rohde, Philipp D</creatorcontrib><creatorcontrib>Vidal, Maria-Esther</creatorcontrib><title>Trav-SHACL: Efficiently Validating Networks of SHACL Constraints</title><description>Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for the fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.</description><subject>Computer Science - Databases</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAUhmEvHVDhApjwDST4L3bCRBUVihTBQMQancbnIIs0QY5V6N1XDUzf8uqTHsZupchNWRTiHuJvOOZKCpkLJ7W9Yo9thGP2vtvUzQPfEoU-4JiGE_-AIXhIYfzkr5h-pvg184n4UvJ6GucUIYxpvmYrgmHGm_9ds_Zp29a7rHl7fqk3TQbW2QwrpVEVmqjqjbMGK7MnSb63IEqQDqwiEEpLD7R3nowqURdWeusRdU96ze7-bhdC9x3DAeKpu1C6haLPc3FDgw</recordid><startdate>20210118</startdate><enddate>20210118</enddate><creator>Figuera, Mónica</creator><creator>Rohde, Philipp D</creator><creator>Vidal, Maria-Esther</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210118</creationdate><title>Trav-SHACL: Efficiently Validating Networks of SHACL Constraints</title><author>Figuera, Mónica ; Rohde, Philipp D ; Vidal, Maria-Esther</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a676-e923e253ff9c4764e94bf1fdc6a08a17a62fa0231dafb7df428e3561d6dee3cf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Databases</topic><toplevel>online_resources</toplevel><creatorcontrib>Figuera, Mónica</creatorcontrib><creatorcontrib>Rohde, Philipp D</creatorcontrib><creatorcontrib>Vidal, Maria-Esther</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Figuera, Mónica</au><au>Rohde, Philipp D</au><au>Vidal, Maria-Esther</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Trav-SHACL: Efficiently Validating Networks of SHACL Constraints</atitle><date>2021-01-18</date><risdate>2021</risdate><abstract>Knowledge graphs have emerged as expressive data structures for Web data. Knowledge graph potential and the demand for ecosystems to facilitate their creation, curation, and understanding, is testified in diverse domains, e.g., biomedicine. The Shapes Constraint Language (SHACL) is the W3C recommendation language for integrity constraints over RDF knowledge graphs. Enabling quality assements of knowledge graphs, SHACL is rapidly gaining attention in real-world scenarios. SHACL models integrity constraints as a network of shapes, where a shape contains the constraints to be fullfiled by the same entities. The validation of a SHACL shape schema can face the issue of tractability during validation. To facilitate full adoption, efficient computational methods are required. We present Trav-SHACL, a SHACL engine capable of planning the traversal and execution of a shape schema in a way that invalid entities are detected early and needless validations are minimized. Trav-SHACL reorders the shapes in a shape schema for efficient validation and rewrites target and constraint queries for the fast detection of invalid entities. Trav-SHACL is empirically evaluated on 27 testbeds executed against knowledge graphs of up to 34M triples. Our experimental results suggest that Trav-SHACL exhibits high performance gradually and reduces validation time by a factor of up to 28.93 compared to the state of the art.</abstract><doi>10.48550/arxiv.2101.07136</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2101.07136
ispartof
issn
language eng
recordid cdi_arxiv_primary_2101_07136
source arXiv.org
subjects Computer Science - Databases
title Trav-SHACL: Efficiently Validating Networks of SHACL Constraints
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T03%3A51%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Trav-SHACL:%20Efficiently%20Validating%20Networks%20of%20SHACL%20Constraints&rft.au=Figuera,%20M%C3%B3nica&rft.date=2021-01-18&rft_id=info:doi/10.48550/arxiv.2101.07136&rft_dat=%3Carxiv_GOX%3E2101_07136%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true