openai

OpenAI: GPT-4o-mini (2024-07-18)

GPT-4o-mini is a multimodal model from OpenAI that accepts text, images, and files as input. It supports a 128,000-token context window and can return up to 16,384 tokens per response. Tool use is supported, which makes it viable for agentic workflows, though the model does not include a dedicated reasoning mode and structured output support is unconfirmed based on available data. At $0.15 per million input tokens and $0.60 per million output tokens, this sits at the budget end of capable multimodal models, making it a reasonable candidate for high-volume applications where cost control matters. The tradeoff is that independent benchmark coverage is essentially absent, with only a single aider_polyglot score of 3.6 on record, so performance relative to competing models at this price tier remains unproven. Buyers who need confirmed quality benchmarks before committing should treat this as a gap worth investigating.

Query via API → View on openai → Estimate cost

Quality Score

95/100

price + capability + benchmarks

Input Price

$0.15

per 1M tokens

Output Price

$0.60

per 1M tokens

Context Window

128,000

tokens

Model ID: openai/gpt-4o-mini-2024-07-18
Vendor: openai
Tokenizer: GPT
Input Modalities: text, image, file
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: ✓ accepts images
Audio: no
Moderated: yes

Similar models

openai

OpenAI: GPT-4o-mini (2024-07-18)

Similar models

OpenAI: GPT-4o-mini

OpenAI: GPT-5.2-Codex

OpenAI: GPT-5 Image Mini

OpenAI: GPT-5.5

OpenAI: GPT-5.1-Codex-Max

OpenAI: GPT-5.1-Codex