Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action
Which summary reads better? Pick one — models revealed after.Both summaries are AI-generated.
NVIDIA has released Cosmos 3, an open-source unified artificial intelligence model designed for physical AI applications including robotics, autonomous vehicles, and smart spaces. Built on a Mixture-of-Transformers architecture, Cosmos 3 combines world generation, physical reasoning, and action generation into a single omni-model, eliminating the need to juggle separate models for different tasks. The model is now available on Hugging Face and can process multiple modalities—text, image, video, audio, and action—to simulate and understand the physical world.
NVIDIA has released Cosmos 3, described as the first open omni-model for physical AI, now available on Hugging Face. Built on a Mixture-of-Transformers architecture, it unifies world generation, physical reasoning, and action generation into a single model, processing text, image, video, audio, and action modalities in one forward pass. The model is aimed at applications such as robotics, autonomous vehicles, and smart spaces, enabling simulation and understanding of motion, causality, and physics.