A novel abstractive summarization model based on topic-aware and contrastive learning

The majority of abstractive summarization models are designed based on the Sequence-to-Sequence(Seq2Seq) architecture. These models are able to capture syntactic and contextual information between words. However, Seq2Seq-based summarization models tend to overlook global semantic information. Moreov...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of machine learning and cybernetics 2024-12, Vol.15 (12), p.5563-5577
Hauptverfasser:	Tang, Huanling, Li, Ruiquan, Duan, Wenhao, Dou, Quansheng, Lu, Mingyu
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial Intelligence Complex Systems Computational Intelligence Control Data mining Deep learning Engineering Learning Machine learning Mechatronics Neural networks Original Article Pattern Recognition Robotics Semantics Systems Biology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The majority of abstractive summarization models are designed based on the Sequence-to-Sequence(Seq2Seq) architecture. These models are able to capture syntactic and contextual information between words. However, Seq2Seq-based summarization models tend to overlook global semantic information. Moreover, there exist inconsistency between the objective function and evaluation metrics of this model. To address these limitations, a novel model named ASTCL is proposed in this paper. It integrates the neural topic model into the Seq2Seq framework innovatively, aiming to capture the text’s global semantic information and guide the summary generation. Additionally, it incorporates contrastive learning techniques to mitigate the discrepancy between the objective loss and the evaluation metrics through scoring multiple candidate summaries. On CNN/DM XSum and NYT datasets, the experimental results demonstrate that the ASTCL model outperforms the other generic models in summarization task.
ISSN:	1868-8071 1868-808X
DOI:	10.1007/s13042-024-02263-8