Google shared pre-release access for the new Gemini 2.5 Flash & Flash-Lite Preview 09-2025 models. We’ve independently benchmarked gains in intelligence (particularly for Flash-Lite), output speed and token efficiency compared to predecessors Key takeaways from our intelligence and performance benchmarking: ➤ 🧠 Gemini 2.5 Flash Preview 09-2025 scores 54 in reasoning mode on the Artificial Analysis Intelligence Index, and 47 in the non-reasoning mode, representing a 3 point and 8 point jump respectively compared to Gemini 2.5 Flash released in May 2025 ➤ 🧠 Gemini 2.5 Flash-Lite Preview 09-2025 scores 48 in reasoning mode on the Artificial Analysis Intelligence Index, representing a 8 point uplift compared to Gemini 2.5 Flash-Lite (Reasoning) released in June 2025. In non-reasoning, Gemini 2.5 Flash-Lite Preview 09-2025 scores 42, a 12 point uplift compared to the July version. ➤💲 In reasoning mode, Gemini 2.5 Flash and Flash-Lite Preview 09-2025 are more token-efficient, using fewer output tokens than their predecessors to run the Artificial Analysis Intelligence Index. Gemini 2.5 Flash-Lite Preview 09-2025 uses 50% fewer output tokens than its predecessor, while Gemini 2.5 Flash Preview 09-2025 uses 24% fewer output tokens. ➤⚡ Google Gemini 2.5 Flash-Lite Preview 09-2025 (Reasoning) is ~40% faster than the prior July release, delivering ~887 output tokens/s on Google AI Studio in our API endpoint performance benchmarking. This makes the new Gemini 2.5 Flash-Lite the fastest proprietary model we have benchmarked on the Artificial Analysis website Key model information: ➤ Hybrid reasoning/non-reasoning modes with variable thinking budget ➤ Tool support (e.g. Google Search, code execution) ➤ 1M token context window ➤ Multimodal input (text, audio, image and video) and text only output ➤ Gemini 2.5 Flash-Lite 09-2025 is priced at $0.1/$0.4 per 1M input/output tokens and Gemini 2.5 Flash 09-2025 is priced at $0.3/$2.5 per 1M input/output tokens

Sep 25, 2025 · 5:59 PM UTC

24
37
496