Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series

The field of time series anomaly detection is constantly advancing, with several methods available, making it a challenge to determine the most appropriate method for a specific domain. The evaluation of these methods is facilitated by the use of metrics, which vary widely in their properties. Despi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data mining and knowledge discovery 2024-05, Vol.38 (3), p.1027-1068
Hauptverfasser: Sørbø, Sondre, Ruocco, Massimiliano
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1068
container_issue 3
container_start_page 1027
container_title Data mining and knowledge discovery
container_volume 38
creator Sørbø, Sondre
Ruocco, Massimiliano
description The field of time series anomaly detection is constantly advancing, with several methods available, making it a challenge to determine the most appropriate method for a specific domain. The evaluation of these methods is facilitated by the use of metrics, which vary widely in their properties. Despite the existence of new evaluation metrics, there is limited agreement on which metrics are best suited for specific scenarios and domains, and the most commonly used metrics have faced criticism in the literature. This paper provides a comprehensive overview of the metrics used for the evaluation of time series anomaly detection methods, and also defines a taxonomy of these based on how they are calculated. By defining a set of properties for evaluation metrics and a set of specific case studies and experiments, twenty metrics are analyzed and discussed in detail, highlighting the unique suitability of each for specific tasks. Through extensive experimentation and analysis, this paper argues that the choice of evaluation metric must be made with care, taking into account the specific requirements of the task at hand.
doi_str_mv 10.1007/s10618-023-00988-8
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3050579131</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3050579131</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-15d0bc3e41867088eaffa99272ca3e7e17aac1075fca9c199e17f92c67dc826e3</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pk02yy3qT4D4peFLyFMZ3ULd1NTbbF-umNbcGbpxnm_d4beIydC7gUAPoqCaiEKaCUBUBtTGEO2EAoLQutqrfDvEszKpQRcMxOUpoDgColDBg-4bqZYd90M95_EG-pj43jLX7TNUfe41foQrvhwXNa42KVydDtqcR9iByzjosNn1JPbqs2He-blnii2FA6ZUceF4nO9nPIXu9uX8YPxeT5_nF8MymcrGRfCDWFdydpJEylwRhC77GuS106lKRJaEQnQCvvsHairvPF16Wr9NSZsiI5ZBe73GUMnytKvZ2HVezySytBgdK1kCJT5Y5yMaQUydtlbFqMGyvA_lZpd1XaXKXdVmlNNsmdKWW4m1H8i_7H9QPj1ngD</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3050579131</pqid></control><display><type>article</type><title>Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series</title><source>Springer Nature - Complete Springer Journals</source><creator>Sørbø, Sondre ; Ruocco, Massimiliano</creator><creatorcontrib>Sørbø, Sondre ; Ruocco, Massimiliano</creatorcontrib><description>The field of time series anomaly detection is constantly advancing, with several methods available, making it a challenge to determine the most appropriate method for a specific domain. The evaluation of these methods is facilitated by the use of metrics, which vary widely in their properties. Despite the existence of new evaluation metrics, there is limited agreement on which metrics are best suited for specific scenarios and domains, and the most commonly used metrics have faced criticism in the literature. This paper provides a comprehensive overview of the metrics used for the evaluation of time series anomaly detection methods, and also defines a taxonomy of these based on how they are calculated. By defining a set of properties for evaluation metrics and a set of specific case studies and experiments, twenty metrics are analyzed and discussed in detail, highlighting the unique suitability of each for specific tasks. Through extensive experimentation and analysis, this paper argues that the choice of evaluation metric must be made with care, taking into account the specific requirements of the task at hand.</description><identifier>ISSN: 1384-5810</identifier><identifier>EISSN: 1573-756X</identifier><identifier>DOI: 10.1007/s10618-023-00988-8</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Anomalies ; Artificial Intelligence ; Chemistry and Earth Sciences ; Computer Science ; Data Mining and Knowledge Discovery ; Information Storage and Retrieval ; Physics ; Statistics for Engineering ; Taxonomy ; Time series</subject><ispartof>Data mining and knowledge discovery, 2024-05, Vol.38 (3), p.1027-1068</ispartof><rights>The Author(s) 2023</rights><rights>The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-15d0bc3e41867088eaffa99272ca3e7e17aac1075fca9c199e17f92c67dc826e3</citedby><cites>FETCH-LOGICAL-c363t-15d0bc3e41867088eaffa99272ca3e7e17aac1075fca9c199e17f92c67dc826e3</cites><orcidid>0000-0003-0673-5107</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10618-023-00988-8$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10618-023-00988-8$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27907,27908,41471,42540,51302</link.rule.ids></links><search><creatorcontrib>Sørbø, Sondre</creatorcontrib><creatorcontrib>Ruocco, Massimiliano</creatorcontrib><title>Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series</title><title>Data mining and knowledge discovery</title><addtitle>Data Min Knowl Disc</addtitle><description>The field of time series anomaly detection is constantly advancing, with several methods available, making it a challenge to determine the most appropriate method for a specific domain. The evaluation of these methods is facilitated by the use of metrics, which vary widely in their properties. Despite the existence of new evaluation metrics, there is limited agreement on which metrics are best suited for specific scenarios and domains, and the most commonly used metrics have faced criticism in the literature. This paper provides a comprehensive overview of the metrics used for the evaluation of time series anomaly detection methods, and also defines a taxonomy of these based on how they are calculated. By defining a set of properties for evaluation metrics and a set of specific case studies and experiments, twenty metrics are analyzed and discussed in detail, highlighting the unique suitability of each for specific tasks. Through extensive experimentation and analysis, this paper argues that the choice of evaluation metric must be made with care, taking into account the specific requirements of the task at hand.</description><subject>Anomalies</subject><subject>Artificial Intelligence</subject><subject>Chemistry and Earth Sciences</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Information Storage and Retrieval</subject><subject>Physics</subject><subject>Statistics for Engineering</subject><subject>Taxonomy</subject><subject>Time series</subject><issn>1384-5810</issn><issn>1573-756X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwFPA8-pk02yy3qT4D4peFLyFMZ3ULd1NTbbF-umNbcGbpxnm_d4beIydC7gUAPoqCaiEKaCUBUBtTGEO2EAoLQutqrfDvEszKpQRcMxOUpoDgColDBg-4bqZYd90M95_EG-pj43jLX7TNUfe41foQrvhwXNa42KVydDtqcR9iByzjosNn1JPbqs2He-blnii2FA6ZUceF4nO9nPIXu9uX8YPxeT5_nF8MymcrGRfCDWFdydpJEylwRhC77GuS106lKRJaEQnQCvvsHairvPF16Wr9NSZsiI5ZBe73GUMnytKvZ2HVezySytBgdK1kCJT5Y5yMaQUydtlbFqMGyvA_lZpd1XaXKXdVmlNNsmdKWW4m1H8i_7H9QPj1ngD</recordid><startdate>20240501</startdate><enddate>20240501</enddate><creator>Sørbø, Sondre</creator><creator>Ruocco, Massimiliano</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-0673-5107</orcidid></search><sort><creationdate>20240501</creationdate><title>Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series</title><author>Sørbø, Sondre ; Ruocco, Massimiliano</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-15d0bc3e41867088eaffa99272ca3e7e17aac1075fca9c199e17f92c67dc826e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Anomalies</topic><topic>Artificial Intelligence</topic><topic>Chemistry and Earth Sciences</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Information Storage and Retrieval</topic><topic>Physics</topic><topic>Statistics for Engineering</topic><topic>Taxonomy</topic><topic>Time series</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sørbø, Sondre</creatorcontrib><creatorcontrib>Ruocco, Massimiliano</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Data mining and knowledge discovery</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sørbø, Sondre</au><au>Ruocco, Massimiliano</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series</atitle><jtitle>Data mining and knowledge discovery</jtitle><stitle>Data Min Knowl Disc</stitle><date>2024-05-01</date><risdate>2024</risdate><volume>38</volume><issue>3</issue><spage>1027</spage><epage>1068</epage><pages>1027-1068</pages><issn>1384-5810</issn><eissn>1573-756X</eissn><abstract>The field of time series anomaly detection is constantly advancing, with several methods available, making it a challenge to determine the most appropriate method for a specific domain. The evaluation of these methods is facilitated by the use of metrics, which vary widely in their properties. Despite the existence of new evaluation metrics, there is limited agreement on which metrics are best suited for specific scenarios and domains, and the most commonly used metrics have faced criticism in the literature. This paper provides a comprehensive overview of the metrics used for the evaluation of time series anomaly detection methods, and also defines a taxonomy of these based on how they are calculated. By defining a set of properties for evaluation metrics and a set of specific case studies and experiments, twenty metrics are analyzed and discussed in detail, highlighting the unique suitability of each for specific tasks. Through extensive experimentation and analysis, this paper argues that the choice of evaluation metric must be made with care, taking into account the specific requirements of the task at hand.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10618-023-00988-8</doi><tpages>42</tpages><orcidid>https://orcid.org/0000-0003-0673-5107</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1384-5810
ispartof Data mining and knowledge discovery, 2024-05, Vol.38 (3), p.1027-1068
issn 1384-5810
1573-756X
language eng
recordid cdi_proquest_journals_3050579131
source Springer Nature - Complete Springer Journals
subjects Anomalies
Artificial Intelligence
Chemistry and Earth Sciences
Computer Science
Data Mining and Knowledge Discovery
Information Storage and Retrieval
Physics
Statistics for Engineering
Taxonomy
Time series
title Navigating the metric maze: a taxonomy of evaluation metrics for anomaly detection in time series
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T09%3A14%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Navigating%20the%20metric%20maze:%20a%20taxonomy%20of%20evaluation%20metrics%20for%20anomaly%20detection%20in%20time%20series&rft.jtitle=Data%20mining%20and%20knowledge%20discovery&rft.au=S%C3%B8rb%C3%B8,%20Sondre&rft.date=2024-05-01&rft.volume=38&rft.issue=3&rft.spage=1027&rft.epage=1068&rft.pages=1027-1068&rft.issn=1384-5810&rft.eissn=1573-756X&rft_id=info:doi/10.1007/s10618-023-00988-8&rft_dat=%3Cproquest_cross%3E3050579131%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3050579131&rft_id=info:pmid/&rfr_iscdi=true