Light-X: Generative 4D Video Rendering with Camera and Illumination Control Paper • 2512.05115 • Published 2 days ago • 5
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published 17 days ago • 91
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published 19 days ago • 44
PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image Paper • 2511.13648 • Published 19 days ago • 52
Simulating the Visual World with Artificial Intelligence: A Roadmap Paper • 2511.08585 • Published 25 days ago • 29
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation Paper • 2510.26794 • Published Oct 30 • 26
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors Paper • 2510.17439 • Published Oct 20 • 26
IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction Paper • 2510.22706 • Published Oct 26 • 39