GPT-4o Audio chat

openrouter
openai
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input and $80 per million output audio tokens.

Capabilities

Context Window 128k tokens
Max Output 16k tokens
Inputs
Outputs

Pricing (per 1M tokens)

Input $2.50
Output $10.00
Cache Read -
Cache Write -

Supported Parameters

frequency_penaltylogit_biaslogprobsmax_tokenspresence_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_logprobstop_p