openai

OpenAI: GPT-4o (2024-05-13)

GPT-4o (2024-05-13) is OpenAI's multimodal release, accepting text, images, and files as input within a 128,000-token context window. It supports tool use, which makes it usable in agentic pipelines, but it does not include a built-in reasoning mode. Maximum output is capped at 4,096 tokens per completion, so tasks requiring very long generated responses will hit that ceiling quickly. At $5.00 per million input tokens and $15.00 per million output tokens, it sits in the mid-to-upper pricing tier among comparable models. Its blended benchmark score of 20.4 covers only 2 benchmarks, including a coding-specific result of 40.0, so its measured performance profile is narrow and should be treated as preliminary rather than comprehensive. Teams doing multimodal work with tool integrations may find it a practical shortlist candidate, but buyers prioritizing cost efficiency or well-validated benchmark coverage should compare it carefully against alternatives before committing.

Quality Score
86/100
price + capability + benchmarks
Input Price
$5.00
per 1M tokens
Output Price
$15.00
per 1M tokens
Context Window
128,000
tokens
Model ID
openai/gpt-4o-2024-05-13
Vendor
openai
Tokenizer
GPT
Input Modalities
text, image, file
Output Modalities
text
Max Output
4,096 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
✓ accepts images
Audio
no
Moderated
no

Similar models