VCD-Texture: Variance Alignment based 3D-2D Co-Denoising for Text-Guided Texturing
Recent research on texture synthesis for 3D shapes benefits a lot from dramatically developed 2D text-to-image diffusion models, including inpainting-based and optimization-based approaches. However, these methods ignore the modal gap between the 2D diffusion model and 3D objects, which primarily re...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent research on texture synthesis for 3D shapes benefits a lot from
dramatically developed 2D text-to-image diffusion models, including
inpainting-based and optimization-based approaches. However, these methods
ignore the modal gap between the 2D diffusion model and 3D objects, which
primarily render 3D objects into 2D images and texture each image separately.
In this paper, we revisit the texture synthesis and propose a Variance
alignment based 3D-2D Collaborative Denoising framework, dubbed VCD-Texture, to
address these issues. Formally, we first unify both 2D and 3D latent feature
learning in diffusion self-attention modules with re-projected 3D attention
receptive fields. Subsequently, the denoised multi-view 2D latent features are
aggregated into 3D space and then rasterized back to formulate more consistent
2D predictions. However, the rasterization process suffers from an intractable
variance bias, which is theoretically addressed by the proposed variance
alignment, achieving high-fidelity texture synthesis. Moreover, we present an
inpainting refinement to further improve the details with conflicting regions.
Notably, there is not a publicly available benchmark to evaluate texture
synthesis, which hinders its development. Thus we construct a new evaluation
set built upon three open-source 3D datasets and propose to use four metrics to
thoroughly validate the texturing performance. Comprehensive experiments
demonstrate that VCD-Texture achieves superior performance against other
counterparts. |
---|---|
DOI: | 10.48550/arxiv.2407.04461 |