GPT Audio coding

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

2.50- -10.00

GPT Audio Mini coding

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

0.60- -2.40

GPT Chat Latest chat

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. As OpenAI rolls out new Instant model updates...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

5.000.50 -30.00

GPT-3.5 Turbo coding

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price16k

0.50- -1.50

GPT-3.5 Turbo (older v0613) coding

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price4k

1.00- -2.00

GPT-3.5 Turbo 16k chat

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price16k

3.00- -4.00

GPT-3.5 Turbo Instruct chat

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price4k

1.50- -2.00

GPT-4 reasoning

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price8k

30.00- -60.00

GPT-4 Turbo chat

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

10.00- -30.00

GPT-4 Turbo Preview chat

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. Note: heavily rate limited by OpenAI while...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

10.00- -30.00

GPT-4.1 reasoning

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

2.000.50 -8.00

GPT-4.1 Mini chat

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.400.10 -1.60

GPT-4.1 Nano chat

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

0.100.02 -0.40

GPT-4o chat

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

2.50- -10.00

GPT-4o (2024-05-13) chat

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

5.00- -15.00

GPT-4o (2024-08-06) chat

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more here. GPT-4o ("o" for "omni") is...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

2.501.25 -10.00

GPT-4o (2024-11-20) chat

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

2.501.25 -10.00

GPT-4o Search Preview chat

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

2.50- -10.00

GPT-4o-mini chat

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

0.150.07 -0.60

GPT-4o-mini (2024-07-18) chat

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

0.150.07 -0.60

GPT-4o-mini Search Preview chat

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

0.15- -0.60

GPT-5 coding

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.250.12 -10.00

GPT-5 Chat chat

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

1.250.12 -10.00

GPT-5 Codex coding

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.250.12 -10.00

GPT-5 Image coding

GPT-5 Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

10.001.25 -10.00

GPT-5 Image Mini chat

GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by GPT-5 Mini, with GPT Image 1 Mini for efficient image generation. This natively multimodal model features superior instruction following, text...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

2.500.25 -2.00

GPT-5 Mini reasoning

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

0.250.02 -2.00

GPT-5 Nano reasoning

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

0.050.01 -0.40

GPT-5 Pro coding

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

15.00- -120.00

GPT-5.1 reasoning

GPT-5.1 is the latest frontier-grade model in the GPT-5 series, offering stronger general-purpose reasoning, improved instruction adherence, and a more natural conversational style compared to GPT-5. It uses adaptive reasoning...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.250.13 -10.00

GPT-5.1 Chat reasoning

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

1.250.13 -10.00

GPT-5.1-Codex coding

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.250.13 -10.00

GPT-5.1-Codex-Max coding

GPT-5.1-Codex-Max is OpenAI’s latest agentic coding model, designed for long-running, high-context software development tasks. It is based on an updated version of the 5.1 reasoning stack and trained on agentic...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.250.12 -10.00

GPT-5.1-Codex-Mini coding

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

0.250.02 -2.00

GPT-5.2 reasoning

GPT-5.2 is the latest frontier-grade model in the GPT-5 series, offering stronger agentic and long context perfomance compared to GPT-5.1. It uses adaptive reasoning to allocate computation dynamically, responding quickly...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.750.17 -14.00

GPT-5.2 Chat reasoning

GPT-5.2 Chat (AKA Instant) is the fast, lightweight member of the 5.2 family, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

1.750.17 -14.00

GPT-5.2 Pro reasoning

GPT-5.2 Pro is OpenAI’s most advanced model, offering major improvements in agentic coding and long context performance over GPT-5 Pro. It is optimized for complex tasks that require step-by-step reasoning,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

21.00- -168.00

GPT-5.2-Codex coding

GPT-5.2-Codex is an upgraded version of GPT-5.1-Codex optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.750.17 -14.00

GPT-5.3 Chat chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price128k

1.750.17 -14.00

GPT-5.3-Codex coding

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

1.750.17 -14.00

GPT-5.4 coding

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

2.500.25 -15.00

GPT-5.4 Image 2 reasoning

GPT-5.4 Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price272k

8.002.00 -15.00

GPT-5.4 Mini reasoning

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

0.750.07 -4.50

GPT-5.4 Nano chat

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price400k

0.200.02 -1.25

GPT-5.4 Pro reasoning

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

30.00- -180.00

GPT-5.5 reasoning

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

5.000.50 -30.00

GPT-5.5 Pro reasoning

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price1M

30.00- -180.00

gpt-oss-120b reasoning

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.04- -0.18

gpt-oss-120b (free) reasoning

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

-- --

gpt-oss-20b chat

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.03- -0.14

gpt-oss-20b (free) chat

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

-- --

gpt-oss-safeguard-20b reasoning

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price131k

0.070.04 -0.30

o1 reasoning

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

15.007.50 -60.00

o1-pro reasoning

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

150.00- -600.00

o3 reasoning

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following....

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

2.000.50 -8.00

o3 Deep Research chat

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

10.002.50 -40.00

o3 Mini reasoning

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

1.100.55 -4.40

o3 Mini High reasoning

OpenAI o3-mini-high is the same model as o3-mini with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

1.100.55 -4.40

o3 Pro reasoning

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

20.00- -80.00

o4 Mini reasoning

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

1.100.28 -4.40

o4 Mini Deep Research chat

o4-mini-deep-research is OpenAI's faster, more affordable deep research model—ideal for tackling complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

2.000.50 -8.00

o4 Mini High reasoning

OpenAI o4-mini-high is the same model as o4-mini with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

Context Inputs Outputs Input Price Cache Read Price Cache Write Price Output Price200k

1.100.28 -4.40