An Empirical Study on the Overlapping Problem of Open-Domain Dialogue Datasets
Open-domain dialogue systems aim to converse with humans through text, and dialogue research has heavily relied on benchmark datasets. In this work, we observe the overlapping problem in DailyDialog and OpenSubtitles, two popular open-domain dialogue benchmark datasets. Our systematic analysis then...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Open-domain dialogue systems aim to converse with humans through text, and
dialogue research has heavily relied on benchmark datasets. In this work, we
observe the overlapping problem in DailyDialog and OpenSubtitles, two popular
open-domain dialogue benchmark datasets. Our systematic analysis then shows
that such overlapping can be exploited to obtain fake state-of-the-art
performance. Finally, we address this issue by cleaning these datasets and
setting up a proper data processing procedure for future research. |
---|---|
DOI: | 10.48550/arxiv.2201.06219 |