HOPI: An Efficient Connection Index for Complex XML Document Collections

In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2–hop cover of a directed graph introduced by Cohen et al. In contrast to most of the prior work on XML indexing we consider not only paths with child or parent relationships between the nodes, but al...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Schenkel, Ralf, Theobald, Anja, Weikum, Gerhard
Format: Buchkapitel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 255
container_issue
container_start_page 237
container_title
container_volume 2992
creator Schenkel, Ralf
Theobald, Anja
Weikum, Gerhard
description In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2–hop cover of a directed graph introduced by Cohen et al. In contrast to most of the prior work on XML indexing we consider not only paths with child or parent relationships between the nodes, but also provide space– and time–efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in our XXL search engine. We improve the theoretical concept of a 2–hop cover by developing scalable methods for index creation on very large XML data collections with long paths and extensive cross–linkage. Our experiments show substantial savings in the query performance of the HOPI index over previously proposed index structures in combination with low space requirements.
doi_str_mv 10.1007/978-3-540-24741-8_15
format Book Chapter
fullrecord <record><control><sourceid>proquest_pasca</sourceid><recordid>TN_cdi_pascalfrancis_primary_15672055</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EBC3088220_21_254</sourcerecordid><originalsourceid>FETCH-LOGICAL-p272t-7c0b241f3ecc7efa768fcbde8a7c6769db4f87f19dd7ac58c6ecd0a50440b34d3</originalsourceid><addsrcrecordid>eNotkMlOwzAQhs0qQukbcMiFo2G8JHa4VaXQSkXlABI3y3FsCKRJiFMJ3h6n6Vw8-peR_CF0TeCWAIi7TEjMcMIBUy44wVKR5AhNg8yCuNfkMYpISghmjGcn6HJvEAoApygCBhRngrNzFGXBl5AyeoGm3n_BMJlkGY_Qcrl5Wd3HszpeOFea0tZ9PG_q2pq-bOp4VRf2N3ZNF8RtW4X9_XkdPzRmtx2TVTUm_RU6c7rydnp4J-jtcfE6X-L15mk1n61xSwXtsTCQU04cs8YI67RIpTN5YaUWJhVpVuTcSeFIVhRCm0Sa1JoCdAKcQ854wSboZrzbam905Tpdm9Krtiu3uvsLkFJBIUlCjo45H6z6w3Yqb5pvrwioga8KIBVTgZjas1QD31Bih-Nd87Ozvld2aJnw105X5lO3ve28YiAlpaFJFE04-we3fHgb</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>book_chapter</recordtype><pqid>EBC3088220_21_254</pqid></control><display><type>book_chapter</type><title>HOPI: An Efficient Connection Index for Complex XML Document Collections</title><source>Springer Books</source><creator>Schenkel, Ralf ; Theobald, Anja ; Weikum, Gerhard</creator><contributor>Bertino, Elisa ; Plexousakis, Dimitris ; Christodoulakis, Stavros ; Koubarakis, Manolis ; Ferrari, Elena ; Böhm, Klemens ; Vassilis, Christophides ; Bertino, Elisa ; Christodoulakis, Stavros ; Ferrari, Elena ; Böhm, Klemens ; Christophides, Vassilis ; Koubarakis, Manolis ; Plexousakis, Dimitris</contributor><creatorcontrib>Schenkel, Ralf ; Theobald, Anja ; Weikum, Gerhard ; Bertino, Elisa ; Plexousakis, Dimitris ; Christodoulakis, Stavros ; Koubarakis, Manolis ; Ferrari, Elena ; Böhm, Klemens ; Vassilis, Christophides ; Bertino, Elisa ; Christodoulakis, Stavros ; Ferrari, Elena ; Böhm, Klemens ; Christophides, Vassilis ; Koubarakis, Manolis ; Plexousakis, Dimitris</creatorcontrib><description>In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2–hop cover of a directed graph introduced by Cohen et al. In contrast to most of the prior work on XML indexing we consider not only paths with child or parent relationships between the nodes, but also provide space– and time–efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in our XXL search engine. We improve the theoretical concept of a 2–hop cover by developing scalable methods for index creation on very large XML data collections with long paths and extensive cross–linkage. Our experiments show substantial savings in the query performance of the HOPI index over previously proposed index structures in combination with low space requirements.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 3540212000</identifier><identifier>ISBN: 9783540212003</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 9783540247418</identifier><identifier>EISBN: 3540247416</identifier><identifier>DOI: 10.1007/978-3-540-24741-8_15</identifier><identifier>OCLC: 934980632</identifier><identifier>LCCallNum: QA76.9.D35</identifier><language>eng</language><publisher>Germany: Springer Berlin / Heidelberg</publisher><subject>Applied sciences ; Computer science; control theory; systems ; Directed Graph ; Exact sciences and technology ; Path Expression ; Path Query ; Query Performance ; Software ; Transitive Closure</subject><ispartof>Advances in Database Technology - EDBT 2004, 2004, Vol.2992, p.237-255</ispartof><rights>Springer-Verlag Berlin Heidelberg 2004</rights><rights>2004 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><relation>Lecture Notes in Computer Science</relation></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttps://ebookcentral.proquest.com/covers/3088220-l.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/978-3-540-24741-8_15$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/978-3-540-24741-8_15$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4049,4050,27924,38254,41441,42510</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=15672055$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Bertino, Elisa</contributor><contributor>Plexousakis, Dimitris</contributor><contributor>Christodoulakis, Stavros</contributor><contributor>Koubarakis, Manolis</contributor><contributor>Ferrari, Elena</contributor><contributor>Böhm, Klemens</contributor><contributor>Vassilis, Christophides</contributor><contributor>Bertino, Elisa</contributor><contributor>Christodoulakis, Stavros</contributor><contributor>Ferrari, Elena</contributor><contributor>Böhm, Klemens</contributor><contributor>Christophides, Vassilis</contributor><contributor>Koubarakis, Manolis</contributor><contributor>Plexousakis, Dimitris</contributor><creatorcontrib>Schenkel, Ralf</creatorcontrib><creatorcontrib>Theobald, Anja</creatorcontrib><creatorcontrib>Weikum, Gerhard</creatorcontrib><title>HOPI: An Efficient Connection Index for Complex XML Document Collections</title><title>Advances in Database Technology - EDBT 2004</title><description>In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2–hop cover of a directed graph introduced by Cohen et al. In contrast to most of the prior work on XML indexing we consider not only paths with child or parent relationships between the nodes, but also provide space– and time–efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in our XXL search engine. We improve the theoretical concept of a 2–hop cover by developing scalable methods for index creation on very large XML data collections with long paths and extensive cross–linkage. Our experiments show substantial savings in the query performance of the HOPI index over previously proposed index structures in combination with low space requirements.</description><subject>Applied sciences</subject><subject>Computer science; control theory; systems</subject><subject>Directed Graph</subject><subject>Exact sciences and technology</subject><subject>Path Expression</subject><subject>Path Query</subject><subject>Query Performance</subject><subject>Software</subject><subject>Transitive Closure</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>3540212000</isbn><isbn>9783540212003</isbn><isbn>9783540247418</isbn><isbn>3540247416</isbn><fulltext>true</fulltext><rsrctype>book_chapter</rsrctype><creationdate>2004</creationdate><recordtype>book_chapter</recordtype><recordid>eNotkMlOwzAQhs0qQukbcMiFo2G8JHa4VaXQSkXlABI3y3FsCKRJiFMJ3h6n6Vw8-peR_CF0TeCWAIi7TEjMcMIBUy44wVKR5AhNg8yCuNfkMYpISghmjGcn6HJvEAoApygCBhRngrNzFGXBl5AyeoGm3n_BMJlkGY_Qcrl5Wd3HszpeOFea0tZ9PG_q2pq-bOp4VRf2N3ZNF8RtW4X9_XkdPzRmtx2TVTUm_RU6c7rydnp4J-jtcfE6X-L15mk1n61xSwXtsTCQU04cs8YI67RIpTN5YaUWJhVpVuTcSeFIVhRCm0Sa1JoCdAKcQ854wSboZrzbam905Tpdm9Krtiu3uvsLkFJBIUlCjo45H6z6w3Yqb5pvrwioga8KIBVTgZjas1QD31Bih-Nd87Ozvld2aJnw105X5lO3ve28YiAlpaFJFE04-we3fHgb</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Schenkel, Ralf</creator><creator>Theobald, Anja</creator><creator>Weikum, Gerhard</creator><general>Springer Berlin / Heidelberg</general><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>FFUUA</scope><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>HOPI: An Efficient Connection Index for Complex XML Document Collections</title><author>Schenkel, Ralf ; Theobald, Anja ; Weikum, Gerhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p272t-7c0b241f3ecc7efa768fcbde8a7c6769db4f87f19dd7ac58c6ecd0a50440b34d3</frbrgroupid><rsrctype>book_chapters</rsrctype><prefilter>book_chapters</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Computer science; control theory; systems</topic><topic>Directed Graph</topic><topic>Exact sciences and technology</topic><topic>Path Expression</topic><topic>Path Query</topic><topic>Query Performance</topic><topic>Software</topic><topic>Transitive Closure</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Schenkel, Ralf</creatorcontrib><creatorcontrib>Theobald, Anja</creatorcontrib><creatorcontrib>Weikum, Gerhard</creatorcontrib><collection>ProQuest Ebook Central - Book Chapters - Demo use only</collection><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Schenkel, Ralf</au><au>Theobald, Anja</au><au>Weikum, Gerhard</au><au>Bertino, Elisa</au><au>Plexousakis, Dimitris</au><au>Christodoulakis, Stavros</au><au>Koubarakis, Manolis</au><au>Ferrari, Elena</au><au>Böhm, Klemens</au><au>Vassilis, Christophides</au><au>Bertino, Elisa</au><au>Christodoulakis, Stavros</au><au>Ferrari, Elena</au><au>Böhm, Klemens</au><au>Christophides, Vassilis</au><au>Koubarakis, Manolis</au><au>Plexousakis, Dimitris</au><format>book</format><genre>bookitem</genre><ristype>CHAP</ristype><atitle>HOPI: An Efficient Connection Index for Complex XML Document Collections</atitle><btitle>Advances in Database Technology - EDBT 2004</btitle><seriestitle>Lecture Notes in Computer Science</seriestitle><date>2004</date><risdate>2004</risdate><volume>2992</volume><spage>237</spage><epage>255</epage><pages>237-255</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>3540212000</isbn><isbn>9783540212003</isbn><eisbn>9783540247418</eisbn><eisbn>3540247416</eisbn><abstract>In this paper we present HOPI, a new connection index for XML documents based on the concept of the 2–hop cover of a directed graph introduced by Cohen et al. In contrast to most of the prior work on XML indexing we consider not only paths with child or parent relationships between the nodes, but also provide space– and time–efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in our XXL search engine. We improve the theoretical concept of a 2–hop cover by developing scalable methods for index creation on very large XML data collections with long paths and extensive cross–linkage. Our experiments show substantial savings in the query performance of the HOPI index over previously proposed index structures in combination with low space requirements.</abstract><cop>Germany</cop><pub>Springer Berlin / Heidelberg</pub><doi>10.1007/978-3-540-24741-8_15</doi><oclcid>934980632</oclcid><tpages>19</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof Advances in Database Technology - EDBT 2004, 2004, Vol.2992, p.237-255
issn 0302-9743
1611-3349
language eng
recordid cdi_pascalfrancis_primary_15672055
source Springer Books
subjects Applied sciences
Computer science
control theory
systems
Directed Graph
Exact sciences and technology
Path Expression
Path Query
Query Performance
Software
Transitive Closure
title HOPI: An Efficient Connection Index for Complex XML Document Collections
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T18%3A27%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pasca&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=bookitem&rft.atitle=HOPI:%20An%20Efficient%20Connection%20Index%20for%20Complex%20XML%20Document%20Collections&rft.btitle=Advances%20in%20Database%20Technology%20-%20EDBT%202004&rft.au=Schenkel,%20Ralf&rft.date=2004&rft.volume=2992&rft.spage=237&rft.epage=255&rft.pages=237-255&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=3540212000&rft.isbn_list=9783540212003&rft_id=info:doi/10.1007/978-3-540-24741-8_15&rft_dat=%3Cproquest_pasca%3EEBC3088220_21_254%3C/proquest_pasca%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783540247418&rft.eisbn_list=3540247416&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=EBC3088220_21_254&rft_id=info:pmid/&rfr_iscdi=true