Exploring XML web collections with DescribeX

As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge nee...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on the web 2010-07, Vol.4 (3)
Hauptverfasser: Consens, Mariano P, Miller, Renee J, Rizzolo, Flavio, Vaisman, Alejandro A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 3
container_start_page
container_title ACM transactions on the web
container_volume 4
creator Consens, Mariano P
Miller, Renee J
Rizzolo, Flavio
Vaisman, Alejandro A
description As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.
doi_str_mv 10.1145/564691.564727
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_901653312</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>901653312</sourcerecordid><originalsourceid>FETCH-proquest_miscellaneous_9016533123</originalsourceid><addsrcrecordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>901653312</pqid></control><display><type>article</type><title>Exploring XML web collections with DescribeX</title><source>ACM Digital Library</source><creator>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creator><creatorcontrib>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creatorcontrib><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><identifier>ISSN: 1559-1131</identifier><identifier>DOI: 10.1145/564691.564727</identifier><language>eng</language><subject>Collection ; Extensible Markup Language ; Flexibility ; Industry standards ; Query processing ; Trends ; XML</subject><ispartof>ACM transactions on the web, 2010-07, Vol.4 (3)</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><title>Exploring XML web collections with DescribeX</title><title>ACM transactions on the web</title><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><subject>Collection</subject><subject>Extensible Markup Language</subject><subject>Flexibility</subject><subject>Industry standards</subject><subject>Query processing</subject><subject>Trends</subject><subject>XML</subject><issn>1559-1131</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Consens, Mariano P</creator><creator>Miller, Renee J</creator><creator>Rizzolo, Flavio</creator><creator>Vaisman, Alejandro A</creator><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Exploring XML web collections with DescribeX</title><author>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_miscellaneous_9016533123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Collection</topic><topic>Extensible Markup Language</topic><topic>Flexibility</topic><topic>Industry standards</topic><topic>Query processing</topic><topic>Trends</topic><topic>XML</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on the web</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Consens, Mariano P</au><au>Miller, Renee J</au><au>Rizzolo, Flavio</au><au>Vaisman, Alejandro A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Exploring XML web collections with DescribeX</atitle><jtitle>ACM transactions on the web</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>4</volume><issue>3</issue><issn>1559-1131</issn><abstract>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</abstract><doi>10.1145/564691.564727</doi></addata></record>
fulltext fulltext
identifier ISSN: 1559-1131
ispartof ACM transactions on the web, 2010-07, Vol.4 (3)
issn 1559-1131
language eng
recordid cdi_proquest_miscellaneous_901653312
source ACM Digital Library
subjects Collection
Extensible Markup Language
Flexibility
Industry standards
Query processing
Trends
XML
title Exploring XML web collections with DescribeX
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T15%3A00%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Exploring%20XML%20web%20collections%20with%20DescribeX&rft.jtitle=ACM%20transactions%20on%20the%20web&rft.au=Consens,%20Mariano%20P&rft.date=2010-07-01&rft.volume=4&rft.issue=3&rft.issn=1559-1131&rft_id=info:doi/10.1145/564691.564727&rft_dat=%3Cproquest%3E901653312%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=901653312&rft_id=info:pmid/&rfr_iscdi=true