Exploring XML web collections with DescribeX

As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge nee...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on the web 2010-07, Vol.4 (3)
Hauptverfasser:	Consens, Mariano P, Miller, Renee J, Rizzolo, Flavio, Vaisman, Alejandro A
Format:	Artikel
Sprache:	eng
Schlagworte:	Collection Extensible Markup Language Flexibility Industry standards Query processing Trends XML
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	3
container_start_page
container_title	ACM transactions on the web
container_volume	4
creator	Consens, Mariano P Miller, Renee J Rizzolo, Flavio Vaisman, Alejandro A
description	As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.
doi_str_mv	10.1145/564691.564727
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_901653312</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>901653312</sourcerecordid><originalsourceid>FETCH-proquest_miscellaneous_9016533123</originalsourceid><addsrcrecordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>901653312</pqid></control><display><type>article</type><title>Exploring XML web collections with DescribeX</title><source>ACM Digital Library</source><creator>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creator><creatorcontrib>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creatorcontrib><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><identifier>ISSN: 1559-1131</identifier><identifier>DOI: 10.1145/564691.564727</identifier><language>eng</language><subject>Collection ; Extensible Markup Language ; Flexibility ; Industry standards ; Query processing ; Trends ; XML</subject><ispartof>ACM transactions on the web, 2010-07, Vol.4 (3)</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><title>Exploring XML web collections with DescribeX</title><title>ACM transactions on the web</title><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><subject>Collection</subject><subject>Extensible Markup Language</subject><subject>Flexibility</subject><subject>Industry standards</subject><subject>Query processing</subject><subject>Trends</subject><subject>XML</subject><issn>1559-1131</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Consens, Mariano P</creator><creator>Miller, Renee J</creator><creator>Rizzolo, Flavio</creator><creator>Vaisman, Alejandro A</creator><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Exploring XML web collections with DescribeX</title><author>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_miscellaneous_9016533123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Collection</topic><topic>Extensible Markup Language</topic><topic>Flexibility</topic><topic>Industry standards</topic><topic>Query processing</topic><topic>Trends</topic><topic>XML</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on the web</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Consens, Mariano P</au><au>Miller, Renee J</au><au>Rizzolo, Flavio</au><au>Vaisman, Alejandro A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Exploring XML web collections with DescribeX</atitle><jtitle>ACM transactions on the web</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>4</volume><issue>3</issue><issn>1559-1131</issn><abstract>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</abstract><doi>10.1145/564691.564727</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 1559-1131
ispartof	ACM transactions on the web, 2010-07, Vol.4 (3)
issn	1559-1131
language	eng
recordid	cdi_proquest_miscellaneous_901653312
source	ACM Digital Library
subjects	Collection Extensible Markup Language Flexibility Industry standards Query processing Trends XML
title	Exploring XML web collections with DescribeX
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T15%3A00%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Exploring%20XML%20web%20collections%20with%20DescribeX&rft.jtitle=ACM%20transactions%20on%20the%20web&rft.au=Consens,%20Mariano%20P&rft.date=2010-07-01&rft.volume=4&rft.issue=3&rft.issn=1559-1131&rft_id=info:doi/10.1145/564691.564727&rft_dat=%3Cproquest%3E901653312%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=901653312&rft_id=info:pmid/&rfr_iscdi=true