DTOR: Decision Tree Outlier Regressor to explain anomalies

Explaining outliers occurrence and mechanism of their occurrence can be extremely important in a variety of domains. Malfunctions, frauds, threats, in addition to being correctly identified, oftentimes need a valid explanation in order to effectively perform actionable counteracts. The ever more wid...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Crupi, Riccardo, Regoli, Daniele, Sabatino, Alessandro Damiano, Marano, Immacolata, Brinis, Massimiliano, Albertazzi, Luca, Cirillo, Andrea, Cosentini, Andrea Claudio
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Crupi, Riccardo
Regoli, Daniele
Sabatino, Alessandro Damiano
Marano, Immacolata
Brinis, Massimiliano
Albertazzi, Luca
Cirillo, Andrea
Cosentini, Andrea Claudio
description Explaining outliers occurrence and mechanism of their occurrence can be extremely important in a variety of domains. Malfunctions, frauds, threats, in addition to being correctly identified, oftentimes need a valid explanation in order to effectively perform actionable counteracts. The ever more widespread use of sophisticated Machine Learning approach to identify anomalies make such explanations more challenging. We present the Decision Tree Outlier Regressor (DTOR), a technique for producing rule-based explanations for individual data points by estimating anomaly scores generated by an anomaly detection model. This is accomplished by first applying a Decision Tree Regressor, which computes the estimation score, and then extracting the relative path associated with the data point score. Our results demonstrate the robustness of DTOR even in datasets with a large number of features. Additionally, in contrast to other rule-based approaches, the generated rules are consistently satisfied by the points to be explained. Furthermore, our evaluation metrics indicate comparable performance to Anchors in outlier explanation tasks, with reduced execution time.
doi_str_mv 10.48550/arxiv.2403.10903
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2403_10903</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403_10903</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-d4098c1d05876d5d8ec6d6b245effdd73ac776ddf9f94b11019d4d8f4bab01dd3</originalsourceid><addsrcrecordid>eNotj8tqwzAQRbXpoqT9gK6iH7A7iiRbyq4kfUHAELw3Y8-oCBw7SGlJ_75p2tVdHLicI8SDgtI4a-ER0zl-lSsDulTgQd-K9bZt9mu55SHmOE-yTcyy-TyNkZPc80finOckT7Pk83HEOEmc5gNecL4TNwHHzPf_uxDty3O7eSt2zev75mlXYFXrggx4NygC6-qKLDkeKqr6lbEcAlGtcagvgIIP3vRKgfJkyAXTYw-KSC_E8u_2Kt8dUzxg-u5-I7prhP4B80JB7A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>DTOR: Decision Tree Outlier Regressor to explain anomalies</title><source>arXiv.org</source><creator>Crupi, Riccardo ; Regoli, Daniele ; Sabatino, Alessandro Damiano ; Marano, Immacolata ; Brinis, Massimiliano ; Albertazzi, Luca ; Cirillo, Andrea ; Cosentini, Andrea Claudio</creator><creatorcontrib>Crupi, Riccardo ; Regoli, Daniele ; Sabatino, Alessandro Damiano ; Marano, Immacolata ; Brinis, Massimiliano ; Albertazzi, Luca ; Cirillo, Andrea ; Cosentini, Andrea Claudio</creatorcontrib><description>Explaining outliers occurrence and mechanism of their occurrence can be extremely important in a variety of domains. Malfunctions, frauds, threats, in addition to being correctly identified, oftentimes need a valid explanation in order to effectively perform actionable counteracts. The ever more widespread use of sophisticated Machine Learning approach to identify anomalies make such explanations more challenging. We present the Decision Tree Outlier Regressor (DTOR), a technique for producing rule-based explanations for individual data points by estimating anomaly scores generated by an anomaly detection model. This is accomplished by first applying a Decision Tree Regressor, which computes the estimation score, and then extracting the relative path associated with the data point score. Our results demonstrate the robustness of DTOR even in datasets with a large number of features. Additionally, in contrast to other rule-based approaches, the generated rules are consistently satisfied by the points to be explained. Furthermore, our evaluation metrics indicate comparable performance to Anchors in outlier explanation tasks, with reduced execution time.</description><identifier>DOI: 10.48550/arxiv.2403.10903</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Learning ; Statistics - Machine Learning</subject><creationdate>2024-03</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2403.10903$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2403.10903$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Crupi, Riccardo</creatorcontrib><creatorcontrib>Regoli, Daniele</creatorcontrib><creatorcontrib>Sabatino, Alessandro Damiano</creatorcontrib><creatorcontrib>Marano, Immacolata</creatorcontrib><creatorcontrib>Brinis, Massimiliano</creatorcontrib><creatorcontrib>Albertazzi, Luca</creatorcontrib><creatorcontrib>Cirillo, Andrea</creatorcontrib><creatorcontrib>Cosentini, Andrea Claudio</creatorcontrib><title>DTOR: Decision Tree Outlier Regressor to explain anomalies</title><description>Explaining outliers occurrence and mechanism of their occurrence can be extremely important in a variety of domains. Malfunctions, frauds, threats, in addition to being correctly identified, oftentimes need a valid explanation in order to effectively perform actionable counteracts. The ever more widespread use of sophisticated Machine Learning approach to identify anomalies make such explanations more challenging. We present the Decision Tree Outlier Regressor (DTOR), a technique for producing rule-based explanations for individual data points by estimating anomaly scores generated by an anomaly detection model. This is accomplished by first applying a Decision Tree Regressor, which computes the estimation score, and then extracting the relative path associated with the data point score. Our results demonstrate the robustness of DTOR even in datasets with a large number of features. Additionally, in contrast to other rule-based approaches, the generated rules are consistently satisfied by the points to be explained. Furthermore, our evaluation metrics indicate comparable performance to Anchors in outlier explanation tasks, with reduced execution time.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Learning</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tqwzAQRbXpoqT9gK6iH7A7iiRbyq4kfUHAELw3Y8-oCBw7SGlJ_75p2tVdHLicI8SDgtI4a-ER0zl-lSsDulTgQd-K9bZt9mu55SHmOE-yTcyy-TyNkZPc80finOckT7Pk83HEOEmc5gNecL4TNwHHzPf_uxDty3O7eSt2zev75mlXYFXrggx4NygC6-qKLDkeKqr6lbEcAlGtcagvgIIP3vRKgfJkyAXTYw-KSC_E8u_2Kt8dUzxg-u5-I7prhP4B80JB7A</recordid><startdate>20240316</startdate><enddate>20240316</enddate><creator>Crupi, Riccardo</creator><creator>Regoli, Daniele</creator><creator>Sabatino, Alessandro Damiano</creator><creator>Marano, Immacolata</creator><creator>Brinis, Massimiliano</creator><creator>Albertazzi, Luca</creator><creator>Cirillo, Andrea</creator><creator>Cosentini, Andrea Claudio</creator><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20240316</creationdate><title>DTOR: Decision Tree Outlier Regressor to explain anomalies</title><author>Crupi, Riccardo ; Regoli, Daniele ; Sabatino, Alessandro Damiano ; Marano, Immacolata ; Brinis, Massimiliano ; Albertazzi, Luca ; Cirillo, Andrea ; Cosentini, Andrea Claudio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-d4098c1d05876d5d8ec6d6b245effdd73ac776ddf9f94b11019d4d8f4bab01dd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Learning</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Crupi, Riccardo</creatorcontrib><creatorcontrib>Regoli, Daniele</creatorcontrib><creatorcontrib>Sabatino, Alessandro Damiano</creatorcontrib><creatorcontrib>Marano, Immacolata</creatorcontrib><creatorcontrib>Brinis, Massimiliano</creatorcontrib><creatorcontrib>Albertazzi, Luca</creatorcontrib><creatorcontrib>Cirillo, Andrea</creatorcontrib><creatorcontrib>Cosentini, Andrea Claudio</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Crupi, Riccardo</au><au>Regoli, Daniele</au><au>Sabatino, Alessandro Damiano</au><au>Marano, Immacolata</au><au>Brinis, Massimiliano</au><au>Albertazzi, Luca</au><au>Cirillo, Andrea</au><au>Cosentini, Andrea Claudio</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>DTOR: Decision Tree Outlier Regressor to explain anomalies</atitle><date>2024-03-16</date><risdate>2024</risdate><abstract>Explaining outliers occurrence and mechanism of their occurrence can be extremely important in a variety of domains. Malfunctions, frauds, threats, in addition to being correctly identified, oftentimes need a valid explanation in order to effectively perform actionable counteracts. The ever more widespread use of sophisticated Machine Learning approach to identify anomalies make such explanations more challenging. We present the Decision Tree Outlier Regressor (DTOR), a technique for producing rule-based explanations for individual data points by estimating anomaly scores generated by an anomaly detection model. This is accomplished by first applying a Decision Tree Regressor, which computes the estimation score, and then extracting the relative path associated with the data point score. Our results demonstrate the robustness of DTOR even in datasets with a large number of features. Additionally, in contrast to other rule-based approaches, the generated rules are consistently satisfied by the points to be explained. Furthermore, our evaluation metrics indicate comparable performance to Anchors in outlier explanation tasks, with reduced execution time.</abstract><doi>10.48550/arxiv.2403.10903</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2403.10903
ispartof
issn
language eng
recordid cdi_arxiv_primary_2403_10903
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Learning
Statistics - Machine Learning
title DTOR: Decision Tree Outlier Regressor to explain anomalies
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T01%3A02%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=DTOR:%20Decision%20Tree%20Outlier%20Regressor%20to%20explain%20anomalies&rft.au=Crupi,%20Riccardo&rft.date=2024-03-16&rft_id=info:doi/10.48550/arxiv.2403.10903&rft_dat=%3Carxiv_GOX%3E2403_10903%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true