-
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models
Paper • 2409.07452 • Published • 21 -
Generating 3D-Consistent Videos from Unposed Internet Photos
Paper • 2411.13549 • Published -
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Paper • 2411.04928 • Published • 57 -
CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models
Paper • 2412.12093 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2409.13648
-
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper • 2407.11398 • Published • 10 -
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Paper • 2407.12781 • Published • 13 -
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians
Paper • 2409.13648 • Published • 12 -
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
Paper • 2409.20563 • Published • 9
-
Imagine yourself: Tuning-Free Personalized Image Generation
Paper • 2409.13346 • Published • 70 -
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models
Paper • 2409.13592 • Published • 51 -
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians
Paper • 2409.13648 • Published • 12
-
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 -
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Paper • 2408.12590 • Published • 36 -
Real-Time Video Generation with Pyramid Attention Broadcast
Paper • 2408.12588 • Published • 17 -
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Paper • 2408.11039 • Published • 63
-
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models
Paper • 2409.07452 • Published • 21 -
Generating 3D-Consistent Videos from Unposed Internet Photos
Paper • 2411.13549 • Published -
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
Paper • 2411.04928 • Published • 57 -
CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models
Paper • 2412.12093 • Published
-
Imagine yourself: Tuning-Free Personalized Image Generation
Paper • 2409.13346 • Published • 70 -
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models
Paper • 2409.13592 • Published • 51 -
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians
Paper • 2409.13648 • Published • 12
-
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 -
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Paper • 2408.12590 • Published • 36 -
Real-Time Video Generation with Pyramid Attention Broadcast
Paper • 2408.12588 • Published • 17 -
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Paper • 2408.11039 • Published • 63
-
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper • 2407.11398 • Published • 10 -
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Paper • 2407.12781 • Published • 13 -
V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians
Paper • 2409.13648 • Published • 12 -
DressRecon: Freeform 4D Human Reconstruction from Monocular Video
Paper • 2409.20563 • Published • 9