Active Few-Shot Fine-Tuning
We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for informati...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | We study the question: How can we select the right data for fine-tuning to a
specific task? We call this data selection problem active fine-tuning and show
that it is an instance of transductive active learning, a novel generalization
of classical active learning. We propose ITL, short for information-based
transductive learning, an approach which samples adaptively to maximize
information gained about the specified task. We are the first to show, under
general regularity assumptions, that such decision rules converge uniformly to
the smallest possible uncertainty obtainable from the accessible data. We apply
ITL to the few-shot fine-tuning of large neural networks and show that
fine-tuning with ITL learns the task with significantly fewer examples than the
state-of-the-art. |
---|---|
DOI: | 10.48550/arxiv.2402.15441 |