NVIDIA: Nemotron Nano 9B V2 reasoning

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

Capabilities

Context Window 131k tokens

Max Output 16k tokens

Inputs

Outputs

Pricing (per 1M tokens)

Input $0.04

Output $0.16

Cache Read -

Cache Write -

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p