Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation
Providing quality scores along with Machine Translation (MT) output, so-called reference-free Quality Estimation (QE), is crucial to inform users about the reliability of the translation. We propose a model-specific, unsupervised QE approach, termed $k$NN-QE, that extracts information from the MT mo...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Providing quality scores along with Machine Translation (MT) output,
so-called reference-free Quality Estimation (QE), is crucial to inform users
about the reliability of the translation. We propose a model-specific,
unsupervised QE approach, termed $k$NN-QE, that extracts information from the
MT model's training data using $k$-nearest neighbors. Measuring the performance
of model-specific QE is not straightforward, since they provide quality scores
on their own MT output, thus cannot be evaluated using benchmark QE test sets
containing human quality scores on premade MT output. Therefore, we propose an
automatic evaluation method that uses quality scores from reference-based
metrics as gold standard instead of human-generated ones. We are the first to
conduct detailed analyses and conclude that this automatic method is
sufficient, and the reference-based MetricX-23 is best for the task. |
---|---|
DOI: | 10.48550/arxiv.2404.18031 |