Warning: statistical benchmarking is addictive. Kicking the habit in machine learning

Algorithm performance evaluation is so entrenched in the machine learning community that one could call it an addiction. Like most addictions, it is harmful and very difficult to give up. It is harmful because it has serious limitations. Yet, we have great faith in practicing it in a ritualistic man...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of experimental & theoretical artificial intelligence 2010-03, Vol.22 (1), p.67-80
Hauptverfasser:	Drummond, Chris, Japkowicz, Nathalie
Format:	Artikel
Sprache:	eng
Schlagworte:	algorithm evaluation Algorithms Artificial intelligence Benchmarking Benchmarks Communities Cures Expert systems Machine learning null hypothesis tests Performance evaluation Roads Statistical methods Statistical tests
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Algorithm performance evaluation is so entrenched in the machine learning community that one could call it an addiction. Like most addictions, it is harmful and very difficult to give up. It is harmful because it has serious limitations. Yet, we have great faith in practicing it in a ritualistic manner: we follow a fixed set of rules telling us the measure, the data sets and the statistical test to use. When we read a paper, even as reviewers, we are not sufficiently critical of results that follow these rules. Here, we will debate what are the limitations and how to best address them. This article may not cure the addiction but hopefully it will be a good first step along that road.
ISSN:	0952-813X 1362-3079
DOI:	10.1080/09528130903010295