Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published 6 days ago • 16
Learning from Massive Human Videos for Universal Humanoid Pose Control Paper • 2412.14172 • Published 7 days ago • 10
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Paper • 2412.11457 • Published 10 days ago • 5
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion Paper • 2412.09593 • Published 13 days ago • 17
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale Paper • 2412.06699 • Published 16 days ago • 11
PanoDreamer: 3D Panorama Synthesis from a Single Image Paper • 2412.04827 • Published 20 days ago • 10
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Paper • 2412.03558 • Published 21 days ago • 14
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting Paper • 2412.00177 • Published 26 days ago • 7
Structured 3D Latents for Scalable and Versatile 3D Generation Paper • 2412.01506 • Published 24 days ago • 46
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper • 2412.00174 • Published 26 days ago • 22
Learning 3D Representations from Procedural 3D Programs Paper • 2411.17467 • Published about 1 month ago • 8
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE Paper • 2411.16856 • Published about 1 month ago • 11
Material Anything: Generating Materials for Any 3D Object via Diffusion Paper • 2411.15138 • Published Nov 22 • 42
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation Paper • 2411.14384 • Published Nov 21 • 9
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations Paper • 2411.10818 • Published Nov 16 • 24
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Paper • 2411.08033 • Published Nov 12 • 22
Acoustic Volume Rendering for Neural Impulse Response Fields Paper • 2411.06307 • Published Nov 9 • 5