NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 coding
openrouter
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
0.10- -0.40
NVIDIA: Nemotron 3 Nano 30B A3B chat
openrouter
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k
0.05- -0.20
NVIDIA: Nemotron 3 Nano 30B A3B (free) chat
openrouter
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price256k
-- --
NVIDIA: Nemotron 3 Nano Omni (free) reasoning
openrouter
NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price256k
-- --
NVIDIA: Nemotron 3 Super chat
openrouter
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k
0.09- -0.45
NVIDIA: Nemotron 3 Super (free) chat
openrouter
NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price262k
-- --
NVIDIA: Nemotron Nano 12B 2 VL (free) reasoning
openrouter
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k
-- --
NVIDIA: Nemotron Nano 9B V2 reasoning
openrouter
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k
0.04- -0.16
NVIDIA: Nemotron Nano 9B V2 (free) reasoning
openrouter
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k
-- --