This model has been deprecated. Please transition to the production model google/gemini-2.5-flash

Google: Gemini 2.5 Flash Preview 04-17

google/gemini-2.5-flash-preview

Created Apr 17, 20251,048,576 context

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling.

Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens.

To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing.

Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Recent activity on Gemini 2.5 Flash Preview 04-17

Total usage per day on OpenRouter

May 31Jun 3Jun 6Jun 9Jun 12Jun 15Jun 18Jun 21Jun 24Jun 27Jun 30Jul 3Jul 6Jul 9Jul 12Jul 157.5B15B22.5B30B

More models from Google

    Google: Gemini 2.5 Flash Preview 04-17 – Recent Activity | OpenRouter