Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles

The ability to explain why a machine learning model arrives at a particular prediction is crucial when used as decision support by human operators of critical systems. The provided explanations must be provably correct, and preferably without redundant information, called minimal explanations. In th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2023-03
Hauptverfasser:	Törnblom, John, Karlsson, Emil, Nadjm-Tehrani, Simin
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Computation Cost function Machine learning
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Törnblom, John Karlsson, Emil Nadjm-Tehrani, Simin
description	The ability to explain why a machine learning model arrives at a particular prediction is crucial when used as decision support by human operators of critical systems. The provided explanations must be provably correct, and preferably without redundant information, called minimal explanations. In this paper, we aim at finding explanations for predictions made by tree ensembles that are not only minimal, but also minimum with respect to a cost function. To this end, we first present a highly efficient oracle that can determine the correctness of explanations, surpassing the runtime performance of current state-of-the-art alternatives by several orders of magnitude when computing minimal explanations. Secondly, we adapt an algorithm called MARCO from related works (calling it m-MARCO) for the purpose of computing a single minimum explanation per prediction, and demonstrate an overall speedup factor of two compared to the MARCO algorithm which enumerates all minimal explanations. Finally, we study the obtained explanations from a range of use cases, leading to further insights of their characteristics. In particular, we observe that in several cases, there are more than 100,000 minimal explanations to choose from for a single prediction. In these cases, we see that only a small portion of the minimal explanations are also minimum, and that the minimum explanations are significantly less verbose, hence motivating the aim of this work.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2787735458</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2787735458</sourcerecordid><originalsourceid>FETCH-proquest_journals_27877354583</originalsourceid><addsrcrecordid>eNqNitEKgjAUQEcQJOU_XOhZsM0130XxJejBd5l5jYnbbFeh_r6gPqCnw-GcDYu4EKckzzjfsZhoTNOUnxWXUkSsrozrjbvDxThjV5sUnhYon_OknV6MdwSDD3AN2Jvb163uEboXNAERSkdouwnpwLaDngjjH_fsWJVNUSdz8I8VaWlHvwb3SS1XuVJCZjIX_11vN547_w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2787735458</pqid></control><display><type>article</type><title>Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles</title><source>Free E- Journals</source><creator>Törnblom, John ; Karlsson, Emil ; Nadjm-Tehrani, Simin</creator><creatorcontrib>Törnblom, John ; Karlsson, Emil ; Nadjm-Tehrani, Simin</creatorcontrib><description>The ability to explain why a machine learning model arrives at a particular prediction is crucial when used as decision support by human operators of critical systems. The provided explanations must be provably correct, and preferably without redundant information, called minimal explanations. In this paper, we aim at finding explanations for predictions made by tree ensembles that are not only minimal, but also minimum with respect to a cost function. To this end, we first present a highly efficient oracle that can determine the correctness of explanations, surpassing the runtime performance of current state-of-the-art alternatives by several orders of magnitude when computing minimal explanations. Secondly, we adapt an algorithm called MARCO from related works (calling it m-MARCO) for the purpose of computing a single minimum explanation per prediction, and demonstrate an overall speedup factor of two compared to the MARCO algorithm which enumerates all minimal explanations. Finally, we study the obtained explanations from a range of use cases, leading to further insights of their characteristics. In particular, we observe that in several cases, there are more than 100,000 minimal explanations to choose from for a single prediction. In these cases, we see that only a small portion of the minimal explanations are also minimum, and that the minimum explanations are significantly less verbose, hence motivating the aim of this work.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Computation ; Cost function ; Machine learning</subject><ispartof>arXiv.org, 2023-03</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Törnblom, John</creatorcontrib><creatorcontrib>Karlsson, Emil</creatorcontrib><creatorcontrib>Nadjm-Tehrani, Simin</creatorcontrib><title>Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles</title><title>arXiv.org</title><description>The ability to explain why a machine learning model arrives at a particular prediction is crucial when used as decision support by human operators of critical systems. The provided explanations must be provably correct, and preferably without redundant information, called minimal explanations. In this paper, we aim at finding explanations for predictions made by tree ensembles that are not only minimal, but also minimum with respect to a cost function. To this end, we first present a highly efficient oracle that can determine the correctness of explanations, surpassing the runtime performance of current state-of-the-art alternatives by several orders of magnitude when computing minimal explanations. Secondly, we adapt an algorithm called MARCO from related works (calling it m-MARCO) for the purpose of computing a single minimum explanation per prediction, and demonstrate an overall speedup factor of two compared to the MARCO algorithm which enumerates all minimal explanations. Finally, we study the obtained explanations from a range of use cases, leading to further insights of their characteristics. In particular, we observe that in several cases, there are more than 100,000 minimal explanations to choose from for a single prediction. In these cases, we see that only a small portion of the minimal explanations are also minimum, and that the minimum explanations are significantly less verbose, hence motivating the aim of this work.</description><subject>Algorithms</subject><subject>Computation</subject><subject>Cost function</subject><subject>Machine learning</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNitEKgjAUQEcQJOU_XOhZsM0130XxJejBd5l5jYnbbFeh_r6gPqCnw-GcDYu4EKckzzjfsZhoTNOUnxWXUkSsrozrjbvDxThjV5sUnhYon_OknV6MdwSDD3AN2Jvb163uEboXNAERSkdouwnpwLaDngjjH_fsWJVNUSdz8I8VaWlHvwb3SS1XuVJCZjIX_11vN547_w</recordid><startdate>20230316</startdate><enddate>20230316</enddate><creator>Törnblom, John</creator><creator>Karlsson, Emil</creator><creator>Nadjm-Tehrani, Simin</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230316</creationdate><title>Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles</title><author>Törnblom, John ; Karlsson, Emil ; Nadjm-Tehrani, Simin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_27877354583</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Computation</topic><topic>Cost function</topic><topic>Machine learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Törnblom, John</creatorcontrib><creatorcontrib>Karlsson, Emil</creatorcontrib><creatorcontrib>Nadjm-Tehrani, Simin</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Törnblom, John</au><au>Karlsson, Emil</au><au>Nadjm-Tehrani, Simin</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles</atitle><jtitle>arXiv.org</jtitle><date>2023-03-16</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>The ability to explain why a machine learning model arrives at a particular prediction is crucial when used as decision support by human operators of critical systems. The provided explanations must be provably correct, and preferably without redundant information, called minimal explanations. In this paper, we aim at finding explanations for predictions made by tree ensembles that are not only minimal, but also minimum with respect to a cost function. To this end, we first present a highly efficient oracle that can determine the correctness of explanations, surpassing the runtime performance of current state-of-the-art alternatives by several orders of magnitude when computing minimal explanations. Secondly, we adapt an algorithm called MARCO from related works (calling it m-MARCO) for the purpose of computing a single minimum explanation per prediction, and demonstrate an overall speedup factor of two compared to the MARCO algorithm which enumerates all minimal explanations. Finally, we study the obtained explanations from a range of use cases, leading to further insights of their characteristics. In particular, we observe that in several cases, there are more than 100,000 minimal explanations to choose from for a single prediction. In these cases, we see that only a small portion of the minimal explanations are also minimum, and that the minimum explanations are significantly less verbose, hence motivating the aim of this work.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-03
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2787735458
source	Free E- Journals
subjects	Algorithms Computation Cost function Machine learning
title	Finding Minimum-Cost Explanations for Predictions made by Tree Ensembles
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T07%3A11%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Finding%20Minimum-Cost%20Explanations%20for%20Predictions%20made%20by%20Tree%20Ensembles&rft.jtitle=arXiv.org&rft.au=T%C3%B6rnblom,%20John&rft.date=2023-03-16&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2787735458%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2787735458&rft_id=info:pmid/&rfr_iscdi=true