Imbalance accuracy metric for model selection in multi-class imbalance classification problems

The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge-based systems 2020-12, Vol.210, p.106490, Article 106490
1. Verfasser:	Mortaz, Ebrahim
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Classification accuracy Datasets Imbalance datasets Knowledge discovery Machine learning Model accuracy Multi-class problems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page	106490
container_title	Knowledge-based systems
container_volume	210
creator	Mortaz, Ebrahim
description	The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.
doi_str_mv	10.1016/j.knosys.2020.106490
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2470032860</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705120306195</els_id><sourcerecordid>2470032860</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AxcB1x1vHk3ajSCDLxhwo1tDeptAxj7GpBXm39uZiktXl3s551zOR8g1gxUDpm63q8-uT_u04sAPJyVLOCELVmieaQnlKVlAmUOmIWfn5CKlLQBwzooF-XhpK9vYDh21iGO0uKetG2JA6vtI2752DU2ucTiEvqOho-3YDCHDxqZEw5_5uAcf0B51u9hXjWvTJTnztknu6ncuyfvjw9v6Odu8Pr2s7zcZCiGHjMkKUWirmGNCqoqh1yWWosqlcrXAEmtVAeaFZo4jgkVbq8Jj7XXhZYliSW7m3Onx1-jSYLb9GLvppeFSAwheKJhUclZh7FOKzptdDK2Ne8PAHEiarZlJmgNJM5OcbHezzU0NvoOLJmFwU-s6xImLqfvwf8APpKKArg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2470032860</pqid></control><display><type>article</type><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><source>Access via ScienceDirect (Elsevier)</source><creator>Mortaz, Ebrahim</creator><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><description>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2020.106490</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accuracy ; Classification accuracy ; Datasets ; Imbalance datasets ; Knowledge discovery ; Machine learning ; Model accuracy ; Multi-class problems</subject><ispartof>Knowledge-based systems, 2020-12, Vol.210, p.106490, Article 106490</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Dec 27, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</citedby><cites>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.knosys.2020.106490$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,781,785,3551,27926,27927,45997</link.rule.ids></links><search><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><title>Knowledge-based systems</title><description>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</description><subject>Accuracy</subject><subject>Classification accuracy</subject><subject>Datasets</subject><subject>Imbalance datasets</subject><subject>Knowledge discovery</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Multi-class problems</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AxcB1x1vHk3ajSCDLxhwo1tDeptAxj7GpBXm39uZiktXl3s551zOR8g1gxUDpm63q8-uT_u04sAPJyVLOCELVmieaQnlKVlAmUOmIWfn5CKlLQBwzooF-XhpK9vYDh21iGO0uKetG2JA6vtI2752DU2ucTiEvqOho-3YDCHDxqZEw5_5uAcf0B51u9hXjWvTJTnztknu6ncuyfvjw9v6Odu8Pr2s7zcZCiGHjMkKUWirmGNCqoqh1yWWosqlcrXAEmtVAeaFZo4jgkVbq8Jj7XXhZYliSW7m3Onx1-jSYLb9GLvppeFSAwheKJhUclZh7FOKzptdDK2Ne8PAHEiarZlJmgNJM5OcbHezzU0NvoOLJmFwU-s6xImLqfvwf8APpKKArg</recordid><startdate>20201227</startdate><enddate>20201227</enddate><creator>Mortaz, Ebrahim</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20201227</creationdate><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><author>Mortaz, Ebrahim</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Classification accuracy</topic><topic>Datasets</topic><topic>Imbalance datasets</topic><topic>Knowledge discovery</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Multi-class problems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mortaz, Ebrahim</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</atitle><jtitle>Knowledge-based systems</jtitle><date>2020-12-27</date><risdate>2020</risdate><volume>210</volume><spage>106490</spage><pages>106490-</pages><artnum>106490</artnum><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2020.106490</doi></addata></record>
fulltext	fulltext
identifier	ISSN: 0950-7051
ispartof	Knowledge-based systems, 2020-12, Vol.210, p.106490, Article 106490
issn	0950-7051 1872-7409
language	eng
recordid	cdi_proquest_journals_2470032860
source	Access via ScienceDirect (Elsevier)
subjects	Accuracy Classification accuracy Datasets Imbalance datasets Knowledge discovery Machine learning Model accuracy Multi-class problems
title	Imbalance accuracy metric for model selection in multi-class imbalance classification problems
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T20%3A10%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Imbalance%20accuracy%20metric%20for%20model%20selection%20in%20multi-class%20imbalance%20classification%20problems&rft.jtitle=Knowledge-based%20systems&rft.au=Mortaz,%20Ebrahim&rft.date=2020-12-27&rft.volume=210&rft.spage=106490&rft.pages=106490-&rft.artnum=106490&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2020.106490&rft_dat=%3Cproquest_cross%3E2470032860%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2470032860&rft_id=info:pmid/&rft_els_id=S0950705120306195&rfr_iscdi=true