Google shared pre-release access for the new Gemini 2.5 Flash & Flash-Lite Preview 09-2025 models. We’ve independently benchmarked gains in intelligence (particularly for Flash-Lite), output speed and token efficiency compared to predecessors
Key takeaways from our intelligence and performance benchmarking:
➤ 🧠 Gemini 2.5 Flash Preview 09-2025 scores 54 in reasoning mode on the Artificial Analysis Intelligence Index, and 47 in the non-reasoning mode, representing a 3 point and 8 point jump respectively compared to Gemini 2.5 Flash released in May 2025
➤ 🧠 Gemini 2.5 Flash-Lite Preview 09-2025 scores 48 in reasoning mode on the Artificial Analysis Intelligence Index, representing a 8 point uplift compared to Gemini 2.5 Flash-Lite (Reasoning) released in June 2025. In non-reasoning, Gemini 2.5 Flash-Lite Preview 09-2025 scores 42, a 12 point uplift compared to the July version.
➤💲 In reasoning mode, Gemini 2.5 Flash and Flash-Lite Preview 09-2025 are more token-efficient, using fewer output tokens than their predecessors to run the Artificial Analysis Intelligence Index. Gemini 2.5 Flash-Lite Preview 09-2025 uses 50% fewer output tokens than its predecessor, while Gemini 2.5 Flash Preview 09-2025 uses 24% fewer output tokens.
➤⚡ Google Gemini 2.5 Flash-Lite Preview 09-2025 (Reasoning) is ~40% faster than the prior July release, delivering ~887 output tokens/s on Google AI Studio in our API endpoint performance benchmarking. This makes the new Gemini 2.5 Flash-Lite the fastest proprietary model we have benchmarked on the Artificial Analysis website
Key model information:
➤ Hybrid reasoning/non-reasoning modes with variable thinking budget
➤ Tool support (e.g. Google Search, code execution)
➤ 1M token context window
➤ Multimodal input (text, audio, image and video) and text only output
➤ Gemini 2.5 Flash-Lite 09-2025 is priced at $0.1/$0.4 per 1M input/output tokens and Gemini 2.5 Flash 09-2025 is priced at $0.3/$2.5 per 1M input/output tokens
Sep 25, 2025 · 5:59 PM UTC
24
37
496

