NIM for Cosmos WFM

Type: Microservice Tags: NVIDIA, NIM, Cosmos, world foundation models, physical AI, video generation, synthetic data, robotics, autonomous vehicles Related: NVIDIA-Cosmos, NIM-for-Cosmos-Reason, NIM-for-Cosmos-Embed1, NIM-for-Vision-Language-Models, NIM-for-Visual-Generative-AI, NVIDIA-NIM, NVIDIA-Cosmos-Curator-LHA, NVIDIA-Omniverse, NVIDIA-Isaac, NVIDIA-Isaac-Sim, NVIDIA-Isaac-GR00T, NVIDIA-DRIVE-Sim, NVIDIA-NGC-Catalog, TensorRT, TensorRT-LLM, Triton-Inference-Server Sources: https://docs.nvidia.com/nim/cosmos/latest/introduction.html, https://docs.nvidia.com/nim/cosmos/latest/index.html, https://docs.nvidia.com/nim/cosmos/latest/support-matrix.html Last Updated: 2026-04-29

Summary

NVIDIA NIM for Cosmos WFM is the self-hosted inference microservice surface for NVIDIA Cosmos world foundation models. Current NVIDIA docs describe it as covering Cosmos-Predict1 and Cosmos-Transfer2.5, with text-to-world, image/video-to-world, and video-to-video transfer workflows for physical AI data generation.

Detail

Purpose

Cosmos WFM NIM gives IT and DevOps teams a production deployment path for Cosmos models while giving developers standard API access to advanced physical AI generation workflows. It sits between the broader NVIDIA-Cosmos model/platform stack and production NIM operations.

Current scope

  • Cosmos-Predict1-7B Text2World for generating world/video output from text prompts.
  • Cosmos-Predict1-7B Video2World for image-to-world and video-to-world generation.
  • Cosmos-Transfer2.5-2B for controlled video-to-video transfer with edge, depth, visual/blur, and segmentation guidance.
  • Triton-backed model serving, health checks, metadata, observability, and REST/gRPC access.
  • NGC container distribution, model download/cache behavior, and container security scan reports.
  • Guardrail pipeline components for text, video, and face-blur safety handling.

NVIDIA context

Cosmos WFM NIM is the deployable microservice counterpart to NVIDIA-Cosmos. It connects world model generation to NVIDIA-Omniverse, NVIDIA-Isaac, and NVIDIA-DRIVE-Sim workflows that need synthetic video, scenario generation, post-training data, and physical AI validation.

Connections

Source Excerpts

  • Current NVIDIA docs say NIM for Cosmos WFM includes Cosmos-Predict1 and Cosmos-Transfer2.5.
  • The docs describe Cosmos WFM NIM as a self-hosting path with standard APIs, NGC containers, Triton serving, and safety guardrails.

Resources