MAI-UI Technical Report: Real-World Centric Foundation GUI Agents Paper β’ 2512.22047 β’ Published 8 days ago β’ 25
InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion Paper β’ 2512.17504 β’ Published 15 days ago β’ 94
Cosmos-Reason1 Collection Multimodal world understanding through reasoning β’ 8 items β’ Updated 11 days ago β’ 38