Improving the classification of call center service dialogue with key utterences

In the field of customer service management, classifying service dialogues to different business labels is beneficial for managers to improve their service quality. However, the size of labeled service dialogue dataset in real scenarios is usually small due to the expensive labeling cost, which make...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Wireless networks 2021-07, Vol.27 (5), p.3395-3406
Hauptverfasser: Liu, Yuqi, Cao, Bin, Ma, Kui, Fan, Jing
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In the field of customer service management, classifying service dialogues to different business labels is beneficial for managers to improve their service quality. However, the size of labeled service dialogue dataset in real scenarios is usually small due to the expensive labeling cost, which makes it difficult to fully train the supervised classification models. Moreover, the service dialogue usually contains chitchat which can be regarded as the noise affecting the classification performance. Existing text classification methods fail to address above two issues simultaneously. Hence, in this paper, we propose a dialogue classification algorithm that strengthens the influence of the business-related utterances in the dialogue and use them as the key utterances to improve the classification. Firstly, we propose key utterance labels that can indicate which utterances in the dialogue are key utterances. Then, we propose the dialogue classification model that is based on the key utterance labels and logistic regression, namely KU-LR. The KU-LR can learn the key utterance patterns and increase the importance of key utterances in the dialogue, and then the KU-LR makes more accurate decisions for dialogue classification. The experimental results on real-world dataset show that the KU-LR method outperforms other baselines when the training dataset is small.
ISSN:1022-0038
1572-8196
DOI:10.1007/s11276-021-02573-7