openai

OpenAI: GPT-4o (2024-11-20)

GPT-4o (2024-11-20) is a multimodal model from OpenAI that accepts text, images, and files as input, with a 128,000-token context window and a maximum of 16,384 output tokens per completion. It supports tool use, making it suitable for agentic workflows and function-calling tasks. Reasoning mode and structured output are not confirmed in the available specifications. At $2.50 per million input tokens and $10.00 per million output tokens, this sits in the mid-to-upper tier of general-purpose model pricing. Its blended benchmark score of 17.3 is drawn from only one benchmark (aider_polyglot at 18.2), so broad performance comparisons should be treated with caution. Teams that need reliable multimodal input handling and tool integration, and are already within the OpenAI ecosystem, are the most natural fit; buyers prioritizing cost efficiency or well-rounded benchmark coverage may want to weigh alternatives before committing.

Query via API → View on openai → Estimate cost

Quality Score

89/100

price + capability + benchmarks

Input Price

$2.50

per 1M tokens

Output Price

$10.00

per 1M tokens

Context Window

128,000

tokens

Model ID: openai/gpt-4o-2024-11-20
Vendor: openai
Tokenizer: GPT
Input Modalities: text, image, file
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: yes

Similar models

openai

OpenAI: GPT-4o (2024-11-20)

Similar models

OpenAI: GPT-4o (2024-08-06)

OpenAI: GPT-4o

OpenAI: GPT-5 Image

OpenAI: GPT Audio Mini

OpenAI: GPT-5.1 Chat

OpenAI: GPT-5.4 Image 2