Mercury Coder coding

openrouter
Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the blog post here.

Capabilities

Context Window 128k tokens
Max Output 32k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $0.25
Output $0.75
Cache Read $0.02
Cache Write -

Supported Parameters

max_tokensresponse_formatstopstructured_outputstemperaturetool_choicetools