OpenAI: GPT-4o-mini (2024-07-18)
GPT-4o-mini is a multimodal model from OpenAI that accepts text, images, and files as input. It supports a 128,000-token context window and can return up to 16,384 tokens per response. Tool use is supported, which makes it viable for agentic workflows, though the model does not include a dedicated reasoning mode and structured output support is unconfirmed based on available data. At $0.15 per million input tokens and $0.60 per million output tokens, this sits at the budget end of capable multimodal models, making it a reasonable candidate for high-volume applications where cost control matters. The tradeoff is that independent benchmark coverage is essentially absent, with only a single aider_polyglot score of 3.6 on record, so performance relative to competing models at this price tier remains unproven. Buyers who need confirmed quality benchmarks before committing should treat this as a gap worth investigating.
- Model ID
- openai/gpt-4o-mini-2024-07-18
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- text, image, file
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- yes