Imbalance accuracy metric for model selection in multi-class imbalance classification problems

The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge-based systems 2020-12, Vol.210, p.106490, Article 106490
1. Verfasser: Mortaz, Ebrahim
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 106490
container_title Knowledge-based systems
container_volume 210
creator Mortaz, Ebrahim
description The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.
doi_str_mv 10.1016/j.knosys.2020.106490
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2470032860</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0950705120306195</els_id><sourcerecordid>2470032860</sourcerecordid><originalsourceid>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AxcB1x1vHk3ajSCDLxhwo1tDeptAxj7GpBXm39uZiktXl3s551zOR8g1gxUDpm63q8-uT_u04sAPJyVLOCELVmieaQnlKVlAmUOmIWfn5CKlLQBwzooF-XhpK9vYDh21iGO0uKetG2JA6vtI2752DU2ucTiEvqOho-3YDCHDxqZEw5_5uAcf0B51u9hXjWvTJTnztknu6ncuyfvjw9v6Odu8Pr2s7zcZCiGHjMkKUWirmGNCqoqh1yWWosqlcrXAEmtVAeaFZo4jgkVbq8Jj7XXhZYliSW7m3Onx1-jSYLb9GLvppeFSAwheKJhUclZh7FOKzptdDK2Ne8PAHEiarZlJmgNJM5OcbHezzU0NvoOLJmFwU-s6xImLqfvwf8APpKKArg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2470032860</pqid></control><display><type>article</type><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><source>Access via ScienceDirect (Elsevier)</source><creator>Mortaz, Ebrahim</creator><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><description>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</description><identifier>ISSN: 0950-7051</identifier><identifier>EISSN: 1872-7409</identifier><identifier>DOI: 10.1016/j.knosys.2020.106490</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accuracy ; Classification accuracy ; Datasets ; Imbalance datasets ; Knowledge discovery ; Machine learning ; Model accuracy ; Multi-class problems</subject><ispartof>Knowledge-based systems, 2020-12, Vol.210, p.106490, Article 106490</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Dec 27, 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</citedby><cites>FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.knosys.2020.106490$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,781,785,3551,27926,27927,45997</link.rule.ids></links><search><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><title>Knowledge-based systems</title><description>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</description><subject>Accuracy</subject><subject>Classification accuracy</subject><subject>Datasets</subject><subject>Imbalance datasets</subject><subject>Knowledge discovery</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Multi-class problems</subject><issn>0950-7051</issn><issn>1872-7409</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AxcB1x1vHk3ajSCDLxhwo1tDeptAxj7GpBXm39uZiktXl3s551zOR8g1gxUDpm63q8-uT_u04sAPJyVLOCELVmieaQnlKVlAmUOmIWfn5CKlLQBwzooF-XhpK9vYDh21iGO0uKetG2JA6vtI2752DU2ucTiEvqOho-3YDCHDxqZEw5_5uAcf0B51u9hXjWvTJTnztknu6ncuyfvjw9v6Odu8Pr2s7zcZCiGHjMkKUWirmGNCqoqh1yWWosqlcrXAEmtVAeaFZo4jgkVbq8Jj7XXhZYliSW7m3Onx1-jSYLb9GLvppeFSAwheKJhUclZh7FOKzptdDK2Ne8PAHEiarZlJmgNJM5OcbHezzU0NvoOLJmFwU-s6xImLqfvwf8APpKKArg</recordid><startdate>20201227</startdate><enddate>20201227</enddate><creator>Mortaz, Ebrahim</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20201227</creationdate><title>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</title><author>Mortaz, Ebrahim</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c334t-14bcc37a61e1346b1cf79c93b546ed3c9cd6b0c5871e2cc0acad68fcdf78f49c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accuracy</topic><topic>Classification accuracy</topic><topic>Datasets</topic><topic>Imbalance datasets</topic><topic>Knowledge discovery</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Multi-class problems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mortaz, Ebrahim</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Knowledge-based systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mortaz, Ebrahim</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Imbalance accuracy metric for model selection in multi-class imbalance classification problems</atitle><jtitle>Knowledge-based systems</jtitle><date>2020-12-27</date><risdate>2020</risdate><volume>210</volume><spage>106490</spage><pages>106490-</pages><artnum>106490</artnum><issn>0950-7051</issn><eissn>1872-7409</eissn><abstract>The overall accuracy, macro precision, macro recall, F-score and class balance accuracy, due to their simplicity and easy interpretation, have been among the most popular metrics to measure the performance of classifiers on multi-class problems. However, on imbalance datasets, some of these metrics can be unfairly influenced by heavier classes. Therefore, it is recommended that they are used as a group and not individually. This strategy can unnecessarily complicate the model selection and evaluation in imbalance datasets. In this paper, we introduce a new metric, imbalance accuracy metric (IAM), that can be used as a solo measure for model evaluation and selection. The IAM is built up on top of the existing metrics, is simple to use, and easy to interpret. This metric is meant to be used as a bottom-line measure aiming to eliminate the need for group metric computation and simplify the model selection. •The IAM is proposed as a metric for model selection in multi-class imbalance problems.•The IAM is built up on top of the existing metrics and is simple to use.•The IAM shows how well a classifier does not classify an instance in incorrect classes.•The IAM is to eliminate the need for multiple metric computation in model selection.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.knosys.2020.106490</doi></addata></record>
fulltext fulltext
identifier ISSN: 0950-7051
ispartof Knowledge-based systems, 2020-12, Vol.210, p.106490, Article 106490
issn 0950-7051
1872-7409
language eng
recordid cdi_proquest_journals_2470032860
source Access via ScienceDirect (Elsevier)
subjects Accuracy
Classification accuracy
Datasets
Imbalance datasets
Knowledge discovery
Machine learning
Model accuracy
Multi-class problems
title Imbalance accuracy metric for model selection in multi-class imbalance classification problems
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-17T20%3A10%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Imbalance%20accuracy%20metric%20for%20model%20selection%20in%20multi-class%20imbalance%20classification%20problems&rft.jtitle=Knowledge-based%20systems&rft.au=Mortaz,%20Ebrahim&rft.date=2020-12-27&rft.volume=210&rft.spage=106490&rft.pages=106490-&rft.artnum=106490&rft.issn=0950-7051&rft.eissn=1872-7409&rft_id=info:doi/10.1016/j.knosys.2020.106490&rft_dat=%3Cproquest_cross%3E2470032860%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2470032860&rft_id=info:pmid/&rft_els_id=S0950705120306195&rfr_iscdi=true