Navigation- vs. index-based XML multi-query processing

XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precom...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Bruno, N., Gravano, L., Koudas, N., Divesh Srivastava
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 150
container_issue
container_start_page 139
container_title
container_volume
creator Bruno, N.
Gravano, L.
Koudas, N.
Divesh Srivastava
description XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. We introduce a new index-based technique, index-filter, to answer multiple XML path queries. Index-filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze index-filter and compare it against Y-filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular, we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster. We present experimental results over real and synthetic XML documents that validate our claims.
doi_str_mv 10.1109/ICDE.2003.1260788
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_1260788</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1260788</ieee_id><sourcerecordid>1260788</sourcerecordid><originalsourceid>FETCH-LOGICAL-c264t-39d3965402c3d1534948e68a7cb65f3d77764e9f25b9e7d3ad5826a512094e0b3</originalsourceid><addsrcrecordid>eNotj8FKAzEURQMiKHU-QNzMD2RM8pKXZClj1cKoGwV3JTN5UyLttE6mxf69BXs356wOXMZupaikFP5-UT_OKyUEVFKhsM5dsMJbdzIBFtHgFSty_hanaQPe6WuGb-GQVmFK24GXh1yVaYj0y9uQKZZfr0252a-nxH_2NB7L3bjtKOc0rG7YZR_WmYozZ-zzaf5Rv_Dm_XlRPzS8U6gnDj6CR6OF6iBKA9prR-iC7Vo0PURrLWryvTKtJxshROMUBiOV8JpECzN2999NRLTcjWkTxuPy_A7-AKiyQz8</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Navigation- vs. index-based XML multi-query processing</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Bruno, N. ; Gravano, L. ; Koudas, N. ; Divesh Srivastava</creator><creatorcontrib>Bruno, N. ; Gravano, L. ; Koudas, N. ; Divesh Srivastava</creatorcontrib><description>XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. We introduce a new index-based technique, index-filter, to answer multiple XML path queries. Index-filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze index-filter and compare it against Y-filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular, we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster. We present experimental results over real and synthetic XML documents that validate our claims.</description><identifier>ISBN: 9780780376656</identifier><identifier>ISBN: 078037665X</identifier><identifier>DOI: 10.1109/ICDE.2003.1260788</identifier><language>eng</language><publisher>IEEE</publisher><subject>Algorithm design and analysis ; Data preprocessing ; Database languages ; Filtering ; Impedance matching ; Indexes ; Navigation ; Query processing ; XML</subject><ispartof>Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405), 2003, p.139-150</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c264t-39d3965402c3d1534948e68a7cb65f3d77764e9f25b9e7d3ad5826a512094e0b3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1260788$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2056,4047,4048,27923,54918</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1260788$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Bruno, N.</creatorcontrib><creatorcontrib>Gravano, L.</creatorcontrib><creatorcontrib>Koudas, N.</creatorcontrib><creatorcontrib>Divesh Srivastava</creatorcontrib><title>Navigation- vs. index-based XML multi-query processing</title><title>Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405)</title><addtitle>ICDE</addtitle><description>XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. We introduce a new index-based technique, index-filter, to answer multiple XML path queries. Index-filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze index-filter and compare it against Y-filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular, we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster. We present experimental results over real and synthetic XML documents that validate our claims.</description><subject>Algorithm design and analysis</subject><subject>Data preprocessing</subject><subject>Database languages</subject><subject>Filtering</subject><subject>Impedance matching</subject><subject>Indexes</subject><subject>Navigation</subject><subject>Query processing</subject><subject>XML</subject><isbn>9780780376656</isbn><isbn>078037665X</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2003</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj8FKAzEURQMiKHU-QNzMD2RM8pKXZClj1cKoGwV3JTN5UyLttE6mxf69BXs356wOXMZupaikFP5-UT_OKyUEVFKhsM5dsMJbdzIBFtHgFSty_hanaQPe6WuGb-GQVmFK24GXh1yVaYj0y9uQKZZfr0252a-nxH_2NB7L3bjtKOc0rG7YZR_WmYozZ-zzaf5Rv_Dm_XlRPzS8U6gnDj6CR6OF6iBKA9prR-iC7Vo0PURrLWryvTKtJxshROMUBiOV8JpECzN2999NRLTcjWkTxuPy_A7-AKiyQz8</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Bruno, N.</creator><creator>Gravano, L.</creator><creator>Koudas, N.</creator><creator>Divesh Srivastava</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>2003</creationdate><title>Navigation- vs. index-based XML multi-query processing</title><author>Bruno, N. ; Gravano, L. ; Koudas, N. ; Divesh Srivastava</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c264t-39d3965402c3d1534948e68a7cb65f3d77764e9f25b9e7d3ad5826a512094e0b3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithm design and analysis</topic><topic>Data preprocessing</topic><topic>Database languages</topic><topic>Filtering</topic><topic>Impedance matching</topic><topic>Indexes</topic><topic>Navigation</topic><topic>Query processing</topic><topic>XML</topic><toplevel>online_resources</toplevel><creatorcontrib>Bruno, N.</creatorcontrib><creatorcontrib>Gravano, L.</creatorcontrib><creatorcontrib>Koudas, N.</creatorcontrib><creatorcontrib>Divesh Srivastava</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bruno, N.</au><au>Gravano, L.</au><au>Koudas, N.</au><au>Divesh Srivastava</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Navigation- vs. index-based XML multi-query processing</atitle><btitle>Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405)</btitle><stitle>ICDE</stitle><date>2003</date><risdate>2003</risdate><spage>139</spage><epage>150</epage><pages>139-150</pages><isbn>9780780376656</isbn><isbn>078037665X</isbn><abstract>XML path queries form the basis of complex filtering of XML data. Most current XML path query processing techniques can be divided in two groups. Navigation-based algorithms compute results by analyzing an input document one tag at a time. In contrast, index-based algorithms take advantage of precomputed numbering schemes over the input XML document. We introduce a new index-based technique, index-filter, to answer multiple XML path queries. Index-filter uses indexes built over the document tags to avoid processing large portions of the input document that are guaranteed not to be part of any match. We analyze index-filter and compare it against Y-filter, a state-of-the-art navigation-based technique. We show that both techniques have their advantages, and we discuss the scenarios under which each technique is superior to the other one. In particular, we show that while most XML path query processing techniques work off SAX events, in some cases it pays off to preprocess the input document, augmenting it with auxiliary information that can be used to evaluate the queries faster. We present experimental results over real and synthetic XML documents that validate our claims.</abstract><pub>IEEE</pub><doi>10.1109/ICDE.2003.1260788</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 9780780376656
ispartof Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405), 2003, p.139-150
issn
language eng
recordid cdi_ieee_primary_1260788
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Algorithm design and analysis
Data preprocessing
Database languages
Filtering
Impedance matching
Indexes
Navigation
Query processing
XML
title Navigation- vs. index-based XML multi-query processing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T19%3A08%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Navigation-%20vs.%20index-based%20XML%20multi-query%20processing&rft.btitle=Proceedings%2019th%20International%20Conference%20on%20Data%20Engineering%20(Cat.%20No.03CH37405)&rft.au=Bruno,%20N.&rft.date=2003&rft.spage=139&rft.epage=150&rft.pages=139-150&rft.isbn=9780780376656&rft.isbn_list=078037665X&rft_id=info:doi/10.1109/ICDE.2003.1260788&rft_dat=%3Cieee_6IE%3E1260788%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1260788&rfr_iscdi=true