Adapting Coreference Resolution Models through Active Learning
Neural coreference resolution models trained on one dataset may not transfer to new, low-resource domains. Active learning mitigates this problem by sampling a small subset of data for annotators to label. While active learning is well-defined for classification tasks, its application to coreference...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Neural coreference resolution models trained on one dataset may not transfer
to new, low-resource domains. Active learning mitigates this problem by
sampling a small subset of data for annotators to label. While active learning
is well-defined for classification tasks, its application to coreference
resolution is neither well-defined nor fully understood. This paper explores
how to actively label coreference, examining sources of model uncertainty and
document reading costs. We compare uncertainty sampling strategies and their
advantages through thorough error analysis. In both synthetic and human
experiments, labeling spans within the same document is more effective than
annotating spans across documents. The findings contribute to a more realistic
development of coreference resolution models. |
---|---|
DOI: | 10.48550/arxiv.2104.07611 |