InterCLIP-MEP: Interactive CLIP and Memory-Enhanced Predictor for Multi-modal Sarcasm Detection
The prevalence of sarcasm in social media, conveyed through text-image combinations, presents significant challenges for sentiment analysis and intention mining. Existing multi-modal sarcasm detection methods have been proven to overestimate performance, as they struggle to effectively capture the i...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The prevalence of sarcasm in social media, conveyed through text-image
combinations, presents significant challenges for sentiment analysis and
intention mining. Existing multi-modal sarcasm detection methods have been
proven to overestimate performance, as they struggle to effectively capture the
intricate sarcastic cues that arise from the interaction between an image and
text. To address these issues, we propose InterCLIP-MEP, a novel framework for
multi-modal sarcasm detection. Specifically, we introduce an Interactive CLIP
(InterCLIP) as the backbone to extract text-image representations, enhancing
them by embedding cross-modality information directly within each encoder,
thereby improving the representations to capture text-image interactions
better. Furthermore, an efficient training strategy is designed to adapt
InterCLIP for our proposed Memory-Enhanced Predictor (MEP). MEP uses a dynamic,
fixed-length dual-channel memory to store historical knowledge of valuable test
samples during inference. It then leverages this memory as a non-parametric
classifier to derive the final prediction, offering a more robust recognition
of multi-modal sarcasm. Experiments demonstrate that InterCLIP-MEP achieves
state-of-the-art performance on the MMSD2.0 benchmark, with an accuracy
improvement of 1.08% and an F1 score improvement of 1.51% over the previous
best method. |
---|---|
DOI: | 10.48550/arxiv.2406.16464 |