Gradual Domain Adaptation via Manifold-Constrained Distributionally Robust Optimization
The aim of this paper is to address the challenge of gradual domain adaptation within a class of manifold-constrained data distributions. In particular, we consider a sequence of $T\ge2$ data distributions $P_1,\ldots,P_T$ undergoing a gradual shift, where each pair of consecutive measures $P_i,P_{i...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The aim of this paper is to address the challenge of gradual domain
adaptation within a class of manifold-constrained data distributions. In
particular, we consider a sequence of $T\ge2$ data distributions
$P_1,\ldots,P_T$ undergoing a gradual shift, where each pair of consecutive
measures $P_i,P_{i+1}$ are close to each other in Wasserstein distance. We have
a supervised dataset of size $n$ sampled from $P_0$, while for the subsequent
distributions in the sequence, only unlabeled i.i.d. samples are available.
Moreover, we assume that all distributions exhibit a known favorable attribute,
such as (but not limited to) having intra-class soft/hard margins. In this
context, we propose a methodology rooted in Distributionally Robust
Optimization (DRO) with an adaptive Wasserstein radius. We theoretically show
that this method guarantees the classification error across all $P_i$s can be
suitably bounded. Our bounds rely on a newly introduced {\it {compatibility}}
measure, which fully characterizes the error propagation dynamics along the
sequence. Specifically, for inadequately constrained distributions, the error
can exponentially escalate as we progress through the gradual shifts.
Conversely, for appropriately constrained distributions, the error can be
demonstrated to be linear or even entirely eradicated. We have substantiated
our theoretical findings through several experimental results. |
---|---|
DOI: | 10.48550/arxiv.2410.14061 |