google

Google: Gemini 2.5 Pro Preview 06-05

Gemini 2.5 Pro Preview 06-05 is a multimodal model from Google that accepts text, images, files, and audio as input. It supports tool use and reasoning, giving it a broader operational range than text-only alternatives. Its context window of 1,048,576 tokens is among the larger options available, making it worth considering for tasks that involve long documents or extended multi-turn sessions. Output is capped at 65,536 tokens per completion. At $1.25 per million input tokens and $10.00 per million output tokens, it sits in the mid-to-upper range on price. The critical caveat for comparison shoppers is that it currently has no independent benchmark coverage, so its quality relative to similarly priced competitors is unverified by third-party testing. Teams with high-volume, long-context, or multimodal workloads may find the feature set worth evaluating, but buyers who rely on benchmark data before committing should wait for independent results to emerge.

Quality Score
100/100
price + capability + benchmarks
Input Price
$1.25
per 1M tokens
Output Price
$10.00
per 1M tokens
Context Window
1,048,576
tokens
Model ID
google/gemini-2.5-pro-preview
Vendor
google
Tokenizer
Gemini
Input Modalities
file, image, text, audio
Output Modalities
text
Max Output
65,536 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
✓ accepts images
Audio
✓ accepts audio
Moderated
no

Category rankings

Where Google: Gemini 2.5 Pro Preview 06-05 places across the 3 categories it ranks in. How we rank →

#CategoryScore
#15 TranscriptionVoice · of 19 ranked 115
#15 Audio SummarizationVoice · of 19 ranked 139
#15 TTS ReplacementVoice · of 19 ranked 115

Similar models