TARDiS : Text Augmentation for Refining Diversity and Separability
Text augmentation (TA) is a critical technique for text classification, especially in few-shot settings. This paper introduces a novel LLM-based TA method, TARDiS, to address challenges inherent in the generation and alignment stages of two-stage TA methods. For the generation stage, we propose two...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Text augmentation (TA) is a critical technique for text classification,
especially in few-shot settings. This paper introduces a novel LLM-based TA
method, TARDiS, to address challenges inherent in the generation and alignment
stages of two-stage TA methods. For the generation stage, we propose two
generation processes, SEG and CEG, incorporating multiple class-specific
prompts to enhance diversity and separability. For the alignment stage, we
introduce a class adaptation (CA) method to ensure that generated examples
align with their target classes through verification and modification.
Experimental results demonstrate TARDiS's effectiveness, outperforming
state-of-the-art LLM-based TA methods in various few-shot text classification
tasks. An in-depth analysis confirms the detailed behaviors at each stage. |
---|---|
DOI: | 10.48550/arxiv.2501.02739 |