openai

OpenAI: GPT-4o (2024-05-13)

GPT-4o (2024-05-13) is OpenAI's multimodal release, accepting text, images, and files as input within a 128,000-token context window. It supports tool use, which makes it usable in agentic pipelines, but it does not include a built-in reasoning mode. Maximum output is capped at 4,096 tokens per completion, so tasks requiring very long generated responses will hit that ceiling quickly. At $5.00 per million input tokens and $15.00 per million output tokens, it sits in the mid-to-upper pricing tier among comparable models. Its blended benchmark score of 20.4 covers only 2 benchmarks, including a coding-specific result of 40.0, so its measured performance profile is narrow and should be treated as preliminary rather than comprehensive. Teams doing multimodal work with tool integrations may find it a practical shortlist candidate, but buyers prioritizing cost efficiency or well-validated benchmark coverage should compare it carefully against alternatives before committing.

Query via API → View on openai → Estimate cost

Quality Score

86/100

price + capability + benchmarks

Input Price

$5.00

per 1M tokens

Output Price

$15.00

per 1M tokens

Context Window

128,000

tokens

Model ID: openai/gpt-4o-2024-05-13
Vendor: openai
Tokenizer: GPT
Input Modalities: text, image, file
Output Modalities: text
Max Output: 4,096 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: no

Similar models

openai

OpenAI: GPT-4o (2024-05-13)

Similar models

OpenAI: GPT-5.5 Pro

OpenAI: GPT-5.4 Pro

OpenAI: GPT-5.2 Pro

OpenAI: o3 Deep Research

OpenAI: GPT-5 Pro

OpenAI: o3 Pro