Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams
Real-time surveillance systems, telecommunication systems, and other dynamic environments often generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple times. Much of such data resides at rather low level of abstraction, whereas most analysts a...
Gespeichert in:
Veröffentlicht in: | Distributed and parallel databases : an international journal 2005-09, Vol.18 (2), p.173-197 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 197 |
---|---|
container_issue | 2 |
container_start_page | 173 |
container_title | Distributed and parallel databases : an international journal |
container_volume | 18 |
creator | Han, Jiawei Chen, Yixin Dong, Guozhu Pei, Jian Wah, Benjamin W. Wang, Jianyong Cai, Y. Dora |
description | Real-time surveillance systems, telecommunication systems, and other dynamic environments often generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple times. Much of such data resides at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes (such as trends and outliers). To discover such high-level characteristics, one may need to perform on-line multi-level, multi-dimensional analytical processing of stream data. In this paper, we propose an architecture, called stream_cube, to facilitate on-line, multi-dimensional, multi-level analysis of stream data.For fast online multi-dimensional analysis of stream data, three important techniques are proposed for efficient and effective computation of stream cubes. First, a tilted time frame model is proposed as a multi-resolution model to register time-related data: the more recent data are registered at finer resolution, whereas the more distant data are registered at coarser resolution. This design reduces the overall storage of time-related data and adapts nicely to the data analysis tasks commonly encountered in practice. Second, instead of materializing cuboids at all levels, we propose to maintain a small number of critical layers. Flexible analysis can be efficiently performed based on the concept of observation layer and minimal interesting layer. Third, an efficient stream data cubing algorithm is developed which computes only the layers (cuboids) along a popular path and leaves the other cuboids for query-driven, on-line computation. Based on this design methodology, stream data cube can be constructed and maintained incrementally with a reasonable amount of memory, computation cost, and query response time. This is verified by our substantial performance study. |
doi_str_mv | 10.1007/s10619-005-3296-1 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_34940742</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>34940742</sourcerecordid><originalsourceid>FETCH-LOGICAL-c315t-671315afd82f41019bbebc530ebf00fb76944b833421d566ea9a19fe598b4fbe3</originalsourceid><addsrcrecordid>eNotkLtOwzAUQC0EEqXwAWye2Az3xk4cs1UtL1HEAMyWnV6LoDyKnQz9e1KF6SxHZziMXSPcIoC-SwgFGgGQC5mZQuAJW2CupdC5Lk_ZAkxWiFKX2Tm7SOkHAIxGvWCvH0Mk1_L16Omerzq-itV3PVA1jJF46CN_G5uhFpu6pS7VfeeayXLNIdWJ94Fv3OD43EiX7Cy4JtHVP5fs6_Hhc_0stu9PL-vVVlQS80EUGie6sCuzoBDQeE--yiWQDwDB68Io5UspVYa7vCjIGYcmUG5Kr4InuWQ3c3cf-9-R0mDbOlXUNK6jfkxWKqNAq2wScRar2KcUKdh9rFsXDxbBHrfZeZudttnjNovyD4MwX8M</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>34940742</pqid></control><display><type>article</type><title>Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams</title><source>SpringerLink Journals</source><creator>Han, Jiawei ; Chen, Yixin ; Dong, Guozhu ; Pei, Jian ; Wah, Benjamin W. ; Wang, Jianyong ; Cai, Y. Dora</creator><creatorcontrib>Han, Jiawei ; Chen, Yixin ; Dong, Guozhu ; Pei, Jian ; Wah, Benjamin W. ; Wang, Jianyong ; Cai, Y. Dora</creatorcontrib><description>Real-time surveillance systems, telecommunication systems, and other dynamic environments often generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple times. Much of such data resides at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes (such as trends and outliers). To discover such high-level characteristics, one may need to perform on-line multi-level, multi-dimensional analytical processing of stream data. In this paper, we propose an architecture, called stream_cube, to facilitate on-line, multi-dimensional, multi-level analysis of stream data.For fast online multi-dimensional analysis of stream data, three important techniques are proposed for efficient and effective computation of stream cubes. First, a tilted time frame model is proposed as a multi-resolution model to register time-related data: the more recent data are registered at finer resolution, whereas the more distant data are registered at coarser resolution. This design reduces the overall storage of time-related data and adapts nicely to the data analysis tasks commonly encountered in practice. Second, instead of materializing cuboids at all levels, we propose to maintain a small number of critical layers. Flexible analysis can be efficiently performed based on the concept of observation layer and minimal interesting layer. Third, an efficient stream data cubing algorithm is developed which computes only the layers (cuboids) along a popular path and leaves the other cuboids for query-driven, on-line computation. Based on this design methodology, stream data cube can be constructed and maintained incrementally with a reasonable amount of memory, computation cost, and query response time. This is verified by our substantial performance study.</description><identifier>ISSN: 0926-8782</identifier><identifier>EISSN: 1573-7578</identifier><identifier>DOI: 10.1007/s10619-005-3296-1</identifier><language>eng</language><ispartof>Distributed and parallel databases : an international journal, 2005-09, Vol.18 (2), p.173-197</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c315t-671315afd82f41019bbebc530ebf00fb76944b833421d566ea9a19fe598b4fbe3</citedby><cites>FETCH-LOGICAL-c315t-671315afd82f41019bbebc530ebf00fb76944b833421d566ea9a19fe598b4fbe3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Han, Jiawei</creatorcontrib><creatorcontrib>Chen, Yixin</creatorcontrib><creatorcontrib>Dong, Guozhu</creatorcontrib><creatorcontrib>Pei, Jian</creatorcontrib><creatorcontrib>Wah, Benjamin W.</creatorcontrib><creatorcontrib>Wang, Jianyong</creatorcontrib><creatorcontrib>Cai, Y. Dora</creatorcontrib><title>Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams</title><title>Distributed and parallel databases : an international journal</title><description>Real-time surveillance systems, telecommunication systems, and other dynamic environments often generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple times. Much of such data resides at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes (such as trends and outliers). To discover such high-level characteristics, one may need to perform on-line multi-level, multi-dimensional analytical processing of stream data. In this paper, we propose an architecture, called stream_cube, to facilitate on-line, multi-dimensional, multi-level analysis of stream data.For fast online multi-dimensional analysis of stream data, three important techniques are proposed for efficient and effective computation of stream cubes. First, a tilted time frame model is proposed as a multi-resolution model to register time-related data: the more recent data are registered at finer resolution, whereas the more distant data are registered at coarser resolution. This design reduces the overall storage of time-related data and adapts nicely to the data analysis tasks commonly encountered in practice. Second, instead of materializing cuboids at all levels, we propose to maintain a small number of critical layers. Flexible analysis can be efficiently performed based on the concept of observation layer and minimal interesting layer. Third, an efficient stream data cubing algorithm is developed which computes only the layers (cuboids) along a popular path and leaves the other cuboids for query-driven, on-line computation. Based on this design methodology, stream data cube can be constructed and maintained incrementally with a reasonable amount of memory, computation cost, and query response time. This is verified by our substantial performance study.</description><issn>0926-8782</issn><issn>1573-7578</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNotkLtOwzAUQC0EEqXwAWye2Az3xk4cs1UtL1HEAMyWnV6LoDyKnQz9e1KF6SxHZziMXSPcIoC-SwgFGgGQC5mZQuAJW2CupdC5Lk_ZAkxWiFKX2Tm7SOkHAIxGvWCvH0Mk1_L16Omerzq-itV3PVA1jJF46CN_G5uhFpu6pS7VfeeayXLNIdWJ94Fv3OD43EiX7Cy4JtHVP5fs6_Hhc_0stu9PL-vVVlQS80EUGie6sCuzoBDQeE--yiWQDwDB68Io5UspVYa7vCjIGYcmUG5Kr4InuWQ3c3cf-9-R0mDbOlXUNK6jfkxWKqNAq2wScRar2KcUKdh9rFsXDxbBHrfZeZudttnjNovyD4MwX8M</recordid><startdate>200509</startdate><enddate>200509</enddate><creator>Han, Jiawei</creator><creator>Chen, Yixin</creator><creator>Dong, Guozhu</creator><creator>Pei, Jian</creator><creator>Wah, Benjamin W.</creator><creator>Wang, Jianyong</creator><creator>Cai, Y. Dora</creator><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>200509</creationdate><title>Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams</title><author>Han, Jiawei ; Chen, Yixin ; Dong, Guozhu ; Pei, Jian ; Wah, Benjamin W. ; Wang, Jianyong ; Cai, Y. Dora</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c315t-671315afd82f41019bbebc530ebf00fb76944b833421d566ea9a19fe598b4fbe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Han, Jiawei</creatorcontrib><creatorcontrib>Chen, Yixin</creatorcontrib><creatorcontrib>Dong, Guozhu</creatorcontrib><creatorcontrib>Pei, Jian</creatorcontrib><creatorcontrib>Wah, Benjamin W.</creatorcontrib><creatorcontrib>Wang, Jianyong</creatorcontrib><creatorcontrib>Cai, Y. Dora</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Distributed and parallel databases : an international journal</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Han, Jiawei</au><au>Chen, Yixin</au><au>Dong, Guozhu</au><au>Pei, Jian</au><au>Wah, Benjamin W.</au><au>Wang, Jianyong</au><au>Cai, Y. Dora</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams</atitle><jtitle>Distributed and parallel databases : an international journal</jtitle><date>2005-09</date><risdate>2005</risdate><volume>18</volume><issue>2</issue><spage>173</spage><epage>197</epage><pages>173-197</pages><issn>0926-8782</issn><eissn>1573-7578</eissn><abstract>Real-time surveillance systems, telecommunication systems, and other dynamic environments often generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple times. Much of such data resides at rather low level of abstraction, whereas most analysts are interested in relatively high-level dynamic changes (such as trends and outliers). To discover such high-level characteristics, one may need to perform on-line multi-level, multi-dimensional analytical processing of stream data. In this paper, we propose an architecture, called stream_cube, to facilitate on-line, multi-dimensional, multi-level analysis of stream data.For fast online multi-dimensional analysis of stream data, three important techniques are proposed for efficient and effective computation of stream cubes. First, a tilted time frame model is proposed as a multi-resolution model to register time-related data: the more recent data are registered at finer resolution, whereas the more distant data are registered at coarser resolution. This design reduces the overall storage of time-related data and adapts nicely to the data analysis tasks commonly encountered in practice. Second, instead of materializing cuboids at all levels, we propose to maintain a small number of critical layers. Flexible analysis can be efficiently performed based on the concept of observation layer and minimal interesting layer. Third, an efficient stream data cubing algorithm is developed which computes only the layers (cuboids) along a popular path and leaves the other cuboids for query-driven, on-line computation. Based on this design methodology, stream data cube can be constructed and maintained incrementally with a reasonable amount of memory, computation cost, and query response time. This is verified by our substantial performance study.</abstract><doi>10.1007/s10619-005-3296-1</doi><tpages>25</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0926-8782 |
ispartof | Distributed and parallel databases : an international journal, 2005-09, Vol.18 (2), p.173-197 |
issn | 0926-8782 1573-7578 |
language | eng |
recordid | cdi_proquest_miscellaneous_34940742 |
source | SpringerLink Journals |
title | Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T16%3A33%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stream%20Cube:%20An%20Architecture%20for%20Multi-Dimensional%20Analysis%20of%20Data%20Streams&rft.jtitle=Distributed%20and%20parallel%20databases%20:%20an%20international%20journal&rft.au=Han,%20Jiawei&rft.date=2005-09&rft.volume=18&rft.issue=2&rft.spage=173&rft.epage=197&rft.pages=173-197&rft.issn=0926-8782&rft.eissn=1573-7578&rft_id=info:doi/10.1007/s10619-005-3296-1&rft_dat=%3Cproquest_cross%3E34940742%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=34940742&rft_id=info:pmid/&rfr_iscdi=true |