Video4Spatial: Towards Visuospatial Intelligence with Context-Guided Video Generation Paper • 2512.03040 • Published 4 days ago • 5
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 25 days ago • 194
StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling Paper • 2507.05240 • Published Jul 7 • 47
SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training Paper • 2506.05301 • Published Jun 5 • 56
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published Apr 16 • 35
Trajectory Attention for Fine-grained Video Motion Control Paper • 2411.19324 • Published Nov 28, 2024 • 13