Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling
Most of the previous 3D human pose estimation work relied on the powerful memory capability of the network to obtain suitable 2D-3D mappings from the training data. Few works have studied the modeling of human posture deformation in motion. In this paper, we propose a new modeling method for human p...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Most of the previous 3D human pose estimation work relied on the powerful
memory capability of the network to obtain suitable 2D-3D mappings from the
training data. Few works have studied the modeling of human posture deformation
in motion. In this paper, we propose a new modeling method for human pose
deformations and design an accompanying diffusion-based motion prior. Inspired
by the field of non-rigid structure-from-motion, we divide the task of
reconstructing 3D human skeletons in motion into the estimation of a 3D
reference skeleton, and a frame-by-frame skeleton deformation. A mixed
spatial-temporal NRSfMformer is used to simultaneously estimate the 3D
reference skeleton and the skeleton deformation of each frame from 2D
observations sequence, and then sum them to obtain the pose of each frame.
Subsequently, a loss term based on the diffusion model is used to ensure that
the pipeline learns the correct prior motion knowledge. Finally, we have
evaluated our proposed method on mainstream datasets and obtained superior
results outperforming the state-of-the-art. |
---|---|
DOI: | 10.48550/arxiv.2308.10705 |