SLYKLatent: A Learning Framework for Gaze Estimation Using Deep Facial Feature Learning
In this research, we present SLYKLatent, a novel approach for enhancing gaze estimation by addressing appearance instability challenges in datasets due to aleatoric uncertainties, covariant shifts, and test domain generalization. SLYKLatent utilizes Self-Supervised Learning for initial training with...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this research, we present SLYKLatent, a novel approach for enhancing gaze
estimation by addressing appearance instability challenges in datasets due to
aleatoric uncertainties, covariant shifts, and test domain generalization.
SLYKLatent utilizes Self-Supervised Learning for initial training with facial
expression datasets, followed by refinement with a patch-based tri-branch
network and an inverse explained variance-weighted training loss function. Our
evaluation on benchmark datasets achieves a 10.9% improvement on Gaze360,
supersedes top MPIIFaceGaze results with 3.8%, and leads on a subset of
ETH-XGaze by 11.6%, surpassing existing methods by significant margins.
Adaptability tests on RAF-DB and Affectnet show 86.4% and 60.9% accuracies,
respectively. Ablation studies confirm the effectiveness of SLYKLatent's novel
components. |
---|---|
DOI: | 10.48550/arxiv.2402.01555 |