A Metrological Perspective on Reproducibility in NLP
Reproducibility has become an increasingly debated topic in NLP and ML over recent years, but so far, no commonly accepted definitions of even basic terms or concepts have emerged. The range of different definitions proposed within NLP/ML not only do not agree with each other, they are also not alig...
Gespeichert in:
Veröffentlicht in: | Computational linguistics - Association for Computational Linguistics 2022-12, Vol.48 (4), p.1125-1135 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Reproducibility has become an increasingly debated topic in NLP and ML over
recent years, but so far, no commonly accepted definitions of even basic terms
or concepts have emerged. The range of different definitions proposed within
NLP/ML not only do not agree with each other, they are also not aligned with
standard scientific definitions. This article examines the standard definitions
of repeatability and reproducibility provided by the meta-science of metrology,
and explores what they imply in terms of how to assess reproducibility, and what
adopting them would mean for reproducibility assessment in NLP/ML. It turns out
the standard definitions lead directly to a method for assessing reproducibility
in quantified terms that renders results from reproduction studies comparable
across multiple reproductions of the same original study, as well as
reproductions of different original studies. The article considers where this
method sits in relation to other aspects of NLP work one might wish to assess in
the context of reproducibility. |
---|---|
ISSN: | 0891-2017 1530-9312 |
DOI: | 10.1162/coli_a_00448 |