Unsupervised Topic Segmentation of Meetings with BERT Embeddings
Topic segmentation of meetings is the task of dividing multi-person meeting transcripts into topic blocks. Supervised approaches to the problem have proven intractable due to the difficulties in collecting and accurately annotating large datasets. In this paper we show how previous unsupervised topi...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Topic segmentation of meetings is the task of dividing multi-person meeting
transcripts into topic blocks. Supervised approaches to the problem have proven
intractable due to the difficulties in collecting and accurately annotating
large datasets. In this paper we show how previous unsupervised topic
segmentation methods can be improved using pre-trained neural architectures. We
introduce an unsupervised approach based on BERT embeddings that achieves a
15.5% reduction in error rate over existing unsupervised approaches applied to
two popular datasets for meeting transcripts. |
---|---|
DOI: | 10.48550/arxiv.2106.12978 |