NVIDIA: Nemotron 3 Ultra (free) reasoning
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
Capabilities
Context Window 1M tokens
Max Output 65k tokens
Inputs
Outputs
Pricing (per 1M tokens)
Input $-
Output $-
Cache Read -
Cache Write -
Supported Parameters
include_reasoningmax_tokensreasoningseedtemperaturetool_choicetoolstop_p