A unifying framework for detecting outliers and change points from time series

We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on knowledge and data engineering 2006-04, Vol.18 (4), p.482-492
Hauptverfasser: Takeuchi, J., Yamanishi, K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 492
container_issue 4
container_start_page 482
container_title IEEE transactions on knowledge and data engineering
container_volume 18
creator Takeuchi, J.
Yamanishi, K.
description We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event/trend change detection, activity monitoring, etc. Although, in most previous work, outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them. In this framework, a probabilistic model of time series is incrementally learned using an online discounting learning algorithm, which can track a drifting data source adaptively by forgetting out-of-date statistics gradually. A score for any given data is calculated in terms of its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. By taking an average of the scores over a window of a fixed length and sliding the window, we may obtain a new time series consisting of moving-averaged scores. Change point detection is then reduced to the issue of detecting outliers in that time series. We compare the performance of our framework with those of conventional methods to demonstrate its validity through simulation and experimental applications to incidents detection in network security.
doi_str_mv 10.1109/TKDE.2006.1599387
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_29430187</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1599387</ieee_id><sourcerecordid>896171056</sourcerecordid><originalsourceid>FETCH-LOGICAL-c416t-d30eccec57ca45173d4fd3cb9ce17f1140d17a3b09e9630c8f080aadb4a92dfa3</originalsourceid><addsrcrecordid>eNqFkcFqFTEUhgdRsFYfQNwEQV3N9ZxJMkmWpa1VLLqp65CbOampM5NrMoP07c3lXih0oasTTr7_h8PXNK8RNohgPt58vbjcdAD9BqUxXKsnzQlKqdsODT6tbxDYCi7U8-ZFKXcAoJXGk-bbGVvnGO7jfMtCdhP9SfkXCymzgRbyy36f1mWMlAtz88D8TzffEtulOC-lRtLEljgRK5QjlZfNs-DGQq-O87T58eny5vxze_396sv52XXrBfZLO3Ag78lL5Z2QqPggwsD91nhCFRAFDKgc34Ih03PwOoAG54atcKYbguOnzYdD7y6n3yuVxU6xeBpHN1Nai9WmR4Ug-0q-_yfZGcEBtfo_qEGikV0F3z4C79Ka53quNdh10BuUFcID5HMqJVOwuxwnl-8tgt0bs3tjdm_MHo3VzLtjsSvejVXH7GN5CKoewKCp3JsDF4no4fvY8hc0LZ69</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>912206915</pqid></control><display><type>article</type><title>A unifying framework for detecting outliers and change points from time series</title><source>IEEE Electronic Library (IEL)</source><creator>Takeuchi, J. ; Yamanishi, K.</creator><creatorcontrib>Takeuchi, J. ; Yamanishi, K.</creatorcontrib><description>We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event/trend change detection, activity monitoring, etc. Although, in most previous work, outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them. In this framework, a probabilistic model of time series is incrementally learned using an online discounting learning algorithm, which can track a drifting data source adaptively by forgetting out-of-date statistics gradually. A score for any given data is calculated in terms of its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. By taking an average of the scores over a window of a fixed length and sliding the window, we may obtain a new time series consisting of moving-averaged scores. Change point detection is then reduced to the issue of detecting outliers in that time series. We compare the performance of our framework with those of conventional methods to demonstrate its validity through simulation and experimental applications to incidents detection in network security.</description><identifier>ISSN: 1041-4347</identifier><identifier>EISSN: 1558-2191</identifier><identifier>DOI: 10.1109/TKDE.2006.1599387</identifier><identifier>CODEN: ITKEEH</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Algorithms ; Applied sciences ; AR model ; Change detection ; Change detection algorithms ; change point ; Computer science; control theory; systems ; Data mining ; Data processing. List processing. Character string processing ; Data security ; Deviation ; Economic models ; Event detection ; Exact sciences and technology ; Histograms ; Intrusion detection ; Mathematical models ; Memory and file management (including protection and security) ; Memory organisation. Data processing ; Monitoring ; network security ; Software ; Statistics ; Studies ; Time series ; Windows (intervals)</subject><ispartof>IEEE transactions on knowledge and data engineering, 2006-04, Vol.18 (4), p.482-492</ispartof><rights>2006 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2006</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c416t-d30eccec57ca45173d4fd3cb9ce17f1140d17a3b09e9630c8f080aadb4a92dfa3</citedby><cites>FETCH-LOGICAL-c416t-d30eccec57ca45173d4fd3cb9ce17f1140d17a3b09e9630c8f080aadb4a92dfa3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1599387$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1599387$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=17600919$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Takeuchi, J.</creatorcontrib><creatorcontrib>Yamanishi, K.</creatorcontrib><title>A unifying framework for detecting outliers and change points from time series</title><title>IEEE transactions on knowledge and data engineering</title><addtitle>TKDE</addtitle><description>We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event/trend change detection, activity monitoring, etc. Although, in most previous work, outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them. In this framework, a probabilistic model of time series is incrementally learned using an online discounting learning algorithm, which can track a drifting data source adaptively by forgetting out-of-date statistics gradually. A score for any given data is calculated in terms of its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. By taking an average of the scores over a window of a fixed length and sliding the window, we may obtain a new time series consisting of moving-averaged scores. Change point detection is then reduced to the issue of detecting outliers in that time series. We compare the performance of our framework with those of conventional methods to demonstrate its validity through simulation and experimental applications to incidents detection in network security.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>AR model</subject><subject>Change detection</subject><subject>Change detection algorithms</subject><subject>change point</subject><subject>Computer science; control theory; systems</subject><subject>Data mining</subject><subject>Data processing. List processing. Character string processing</subject><subject>Data security</subject><subject>Deviation</subject><subject>Economic models</subject><subject>Event detection</subject><subject>Exact sciences and technology</subject><subject>Histograms</subject><subject>Intrusion detection</subject><subject>Mathematical models</subject><subject>Memory and file management (including protection and security)</subject><subject>Memory organisation. Data processing</subject><subject>Monitoring</subject><subject>network security</subject><subject>Software</subject><subject>Statistics</subject><subject>Studies</subject><subject>Time series</subject><subject>Windows (intervals)</subject><issn>1041-4347</issn><issn>1558-2191</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNqFkcFqFTEUhgdRsFYfQNwEQV3N9ZxJMkmWpa1VLLqp65CbOampM5NrMoP07c3lXih0oasTTr7_h8PXNK8RNohgPt58vbjcdAD9BqUxXKsnzQlKqdsODT6tbxDYCi7U8-ZFKXcAoJXGk-bbGVvnGO7jfMtCdhP9SfkXCymzgRbyy36f1mWMlAtz88D8TzffEtulOC-lRtLEljgRK5QjlZfNs-DGQq-O87T58eny5vxze_396sv52XXrBfZLO3Ag78lL5Z2QqPggwsD91nhCFRAFDKgc34Ih03PwOoAG54atcKYbguOnzYdD7y6n3yuVxU6xeBpHN1Nai9WmR4Ug-0q-_yfZGcEBtfo_qEGikV0F3z4C79Ka53quNdh10BuUFcID5HMqJVOwuxwnl-8tgt0bs3tjdm_MHo3VzLtjsSvejVXH7GN5CKoewKCp3JsDF4no4fvY8hc0LZ69</recordid><startdate>20060401</startdate><enddate>20060401</enddate><creator>Takeuchi, J.</creator><creator>Yamanishi, K.</creator><general>IEEE</general><general>IEEE Computer Society</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7TB</scope><scope>FR3</scope><scope>F28</scope></search><sort><creationdate>20060401</creationdate><title>A unifying framework for detecting outliers and change points from time series</title><author>Takeuchi, J. ; Yamanishi, K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c416t-d30eccec57ca45173d4fd3cb9ce17f1140d17a3b09e9630c8f080aadb4a92dfa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>AR model</topic><topic>Change detection</topic><topic>Change detection algorithms</topic><topic>change point</topic><topic>Computer science; control theory; systems</topic><topic>Data mining</topic><topic>Data processing. List processing. Character string processing</topic><topic>Data security</topic><topic>Deviation</topic><topic>Economic models</topic><topic>Event detection</topic><topic>Exact sciences and technology</topic><topic>Histograms</topic><topic>Intrusion detection</topic><topic>Mathematical models</topic><topic>Memory and file management (including protection and security)</topic><topic>Memory organisation. Data processing</topic><topic>Monitoring</topic><topic>network security</topic><topic>Software</topic><topic>Statistics</topic><topic>Studies</topic><topic>Time series</topic><topic>Windows (intervals)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Takeuchi, J.</creatorcontrib><creatorcontrib>Yamanishi, K.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Engineering Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><jtitle>IEEE transactions on knowledge and data engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Takeuchi, J.</au><au>Yamanishi, K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A unifying framework for detecting outliers and change points from time series</atitle><jtitle>IEEE transactions on knowledge and data engineering</jtitle><stitle>TKDE</stitle><date>2006-04-01</date><risdate>2006</risdate><volume>18</volume><issue>4</issue><spage>482</spage><epage>492</epage><pages>482-492</pages><issn>1041-4347</issn><eissn>1558-2191</eissn><coden>ITKEEH</coden><abstract>We are concerned with the issue of detecting outliers and change points from time series. In the area of data mining, there have been increased interest in these issues since outlier detection is related to fraud detection, rare event discovery, etc., while change-point detection is related to event/trend change detection, activity monitoring, etc. Although, in most previous work, outlier detection and change point detection have not been related explicitly, this paper presents a unifying framework for dealing with both of them. In this framework, a probabilistic model of time series is incrementally learned using an online discounting learning algorithm, which can track a drifting data source adaptively by forgetting out-of-date statistics gradually. A score for any given data is calculated in terms of its deviation from the learned model, with a higher score indicating a high possibility of being an outlier. By taking an average of the scores over a window of a fixed length and sliding the window, we may obtain a new time series consisting of moving-averaged scores. Change point detection is then reduced to the issue of detecting outliers in that time series. We compare the performance of our framework with those of conventional methods to demonstrate its validity through simulation and experimental applications to incidents detection in network security.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/TKDE.2006.1599387</doi><tpages>11</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1041-4347
ispartof IEEE transactions on knowledge and data engineering, 2006-04, Vol.18 (4), p.482-492
issn 1041-4347
1558-2191
language eng
recordid cdi_proquest_miscellaneous_29430187
source IEEE Electronic Library (IEL)
subjects Algorithms
Applied sciences
AR model
Change detection
Change detection algorithms
change point
Computer science
control theory
systems
Data mining
Data processing. List processing. Character string processing
Data security
Deviation
Economic models
Event detection
Exact sciences and technology
Histograms
Intrusion detection
Mathematical models
Memory and file management (including protection and security)
Memory organisation. Data processing
Monitoring
network security
Software
Statistics
Studies
Time series
Windows (intervals)
title A unifying framework for detecting outliers and change points from time series
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T08%3A34%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20unifying%20framework%20for%20detecting%20outliers%20and%20change%20points%20from%20time%20series&rft.jtitle=IEEE%20transactions%20on%20knowledge%20and%20data%20engineering&rft.au=Takeuchi,%20J.&rft.date=2006-04-01&rft.volume=18&rft.issue=4&rft.spage=482&rft.epage=492&rft.pages=482-492&rft.issn=1041-4347&rft.eissn=1558-2191&rft.coden=ITKEEH&rft_id=info:doi/10.1109/TKDE.2006.1599387&rft_dat=%3Cproquest_RIE%3E896171056%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=912206915&rft_id=info:pmid/&rft_ieee_id=1599387&rfr_iscdi=true