Exploring XML web collections with DescribeX
As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge nee...
Gespeichert in:
Veröffentlicht in: | ACM transactions on the web 2010-07, Vol.4 (3) |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 3 |
container_start_page | |
container_title | ACM transactions on the web |
container_volume | 4 |
creator | Consens, Mariano P Miller, Renee J Rizzolo, Flavio Vaisman, Alejandro A |
description | As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema. |
doi_str_mv | 10.1145/564691.564727 |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_miscellaneous_901653312</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>901653312</sourcerecordid><originalsourceid>FETCH-proquest_miscellaneous_9016533123</originalsourceid><addsrcrecordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>901653312</pqid></control><display><type>article</type><title>Exploring XML web collections with DescribeX</title><source>ACM Digital Library</source><creator>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creator><creatorcontrib>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</creatorcontrib><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><identifier>ISSN: 1559-1131</identifier><identifier>DOI: 10.1145/564691.564727</identifier><language>eng</language><subject>Collection ; Extensible Markup Language ; Flexibility ; Industry standards ; Query processing ; Trends ; XML</subject><ispartof>ACM transactions on the web, 2010-07, Vol.4 (3)</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><title>Exploring XML web collections with DescribeX</title><title>ACM transactions on the web</title><description>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</description><subject>Collection</subject><subject>Extensible Markup Language</subject><subject>Flexibility</subject><subject>Industry standards</subject><subject>Query processing</subject><subject>Trends</subject><subject>XML</subject><issn>1559-1131</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqNyrsOgjAUANAOmoiP0b2bi2gvpSXMinHQzYGNQHPVmkqRC8HP18EPcDrLYWwJYgMQq63SsU5h8yWJkhELQKk0BJAwYVOihxBKR0IHbJ29G-dbW994fj7xAStuvHNoOutr4oPt7nyPZFpbYT5n42vpCBc_Z2x1yC67Y9i0_tUjdcXTkkHnyhp9T0UqQCspIZL_zw_ZcDir</recordid><startdate>20100701</startdate><enddate>20100701</enddate><creator>Consens, Mariano P</creator><creator>Miller, Renee J</creator><creator>Rizzolo, Flavio</creator><creator>Vaisman, Alejandro A</creator><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20100701</creationdate><title>Exploring XML web collections with DescribeX</title><author>Consens, Mariano P ; Miller, Renee J ; Rizzolo, Flavio ; Vaisman, Alejandro A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_miscellaneous_9016533123</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Collection</topic><topic>Extensible Markup Language</topic><topic>Flexibility</topic><topic>Industry standards</topic><topic>Query processing</topic><topic>Trends</topic><topic>XML</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Consens, Mariano P</creatorcontrib><creatorcontrib>Miller, Renee J</creatorcontrib><creatorcontrib>Rizzolo, Flavio</creatorcontrib><creatorcontrib>Vaisman, Alejandro A</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>ACM transactions on the web</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Consens, Mariano P</au><au>Miller, Renee J</au><au>Rizzolo, Flavio</au><au>Vaisman, Alejandro A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Exploring XML web collections with DescribeX</atitle><jtitle>ACM transactions on the web</jtitle><date>2010-07-01</date><risdate>2010</risdate><volume>4</volume><issue>3</issue><issn>1559-1131</issn><abstract>As Web applications mature and evolve, the nature of the semistructured data that drives these applications also changes. An important trend is the need for increased flexibility in the structure of Web documents. Hence, applications cannot rely solely on schemas to provide the complex knowledge needed to visualize, use, query and manage documents. Even when XML Web documents are valid with regard to a schema, the actual structure of such documents may exhibit significant variations across collections for several reasons: the schema may be very lax (e.g., RSS feeds), the schema may be large and different subsets of it may be used in different documents (e.g., industry standards like UBL), or open content models may allow arbitrary schemas to be mixed (e.g., RSS extensions like those used for podcasting). For these reasons, many applications that incorporate XPath queries to process a large Web document collection require an understanding of the actual structure present in the collection, and not just the schema.</abstract><doi>10.1145/564691.564727</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1559-1131 |
ispartof | ACM transactions on the web, 2010-07, Vol.4 (3) |
issn | 1559-1131 |
language | eng |
recordid | cdi_proquest_miscellaneous_901653312 |
source | ACM Digital Library |
subjects | Collection Extensible Markup Language Flexibility Industry standards Query processing Trends XML |
title | Exploring XML web collections with DescribeX |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T15%3A00%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Exploring%20XML%20web%20collections%20with%20DescribeX&rft.jtitle=ACM%20transactions%20on%20the%20web&rft.au=Consens,%20Mariano%20P&rft.date=2010-07-01&rft.volume=4&rft.issue=3&rft.issn=1559-1131&rft_id=info:doi/10.1145/564691.564727&rft_dat=%3Cproquest%3E901653312%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=901653312&rft_id=info:pmid/&rfr_iscdi=true |