World Simulation with Video Foundation Models for Physical AI Paper • 2511.00062 • Published Oct 28 • 40
Bridging Supervised Learning and Reinforcement Learning in Math Reasoning Paper • 2505.18116 • Published May 23 • 4
Cosmos-Reason1 Collection Multimodal world understanding through reasoning • 8 items • Updated 2 days ago • 37
view article Article NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning May 19 • 26
Describe Anything Collection Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 2 days ago • 60
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published Apr 22 • 63
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control Paper • 2503.14492 • Published Mar 18 • 20
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18 • 50
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models Paper • 2411.07126 • Published Nov 11, 2024 • 30
Cosmos-Tokenizer Collection A suite of image and video tokenizers • 13 items • Updated 2 days ago • 42