Google releases Gemini 3.5 Flash – 4x faster than other leading models

ref · May 20, 2026, 1:16am

On May 19, Google unveiled the Gemini 3.5 series at Google I/O 2026; the initial model, 3.5 Flash, is now available worldwide. According to Google, it outperforms Gemini 3.1 Pro in agentic tasks and code generation. It has set new records across key benchmarks such as Terminal-Bench 2.1 (76.2%) and GDPval-AA (1,656 Elo), as well as in the multimodal reasoning test CharXiv Reasoning (84.2%). Its token output speed is four times faster than other leading models, while operational costs for long-term agentic tasks amount to less than half those of competitors. On the Artificial Analysis index, it occupies the upper-right quadrant labeled “cutting-edge intelligence × ultra-fast responsiveness.” The 3.5 Flash model is already integrated into the Gemini app, Google Search’s AI Mode, Google AI Studio, Android Studio, and the enterprise-focused Gemini Enterprise Agent Platform. Meanwhile, the larger flagship model, Gemini 3.5 Pro, is currently being used internally at Google and is slated for public release in June.

Alongside the launch of 3.5 Flash, Google introduced two new infrastructure tools. First is Google Antigravity, an agent-first development platform enabling parallel deployment of multiple sub-agents to handle large-scale, long-duration workflows. Google demonstrated how two collaborating agents managed to read an AlphaZero research paper and develop a playable game within just six hours. Second is Gemini Spark, a personal AI agent built on top of 3.5 Flash; it operates around the clock to perform digital tasks on users’ behalf. Access to Spark is currently limited to trusted testers, with a beta version scheduled to reach U.S.-based subscribers of Google AI Ultra next week. In practical applications, Shopify leverages parallel sub-agents to forecast growth trends among global merchants, Macquarie Bank utilizes the model to analyze documents spanning over 100 pages to expedite customer onboarding, Salesforce integrates it into its Agentforce ecosystem, and Ramp employs it for multimodal OCR-based invoice recognition. Google notes that the entire Gemini 3.5 series was developed under its Frontier Safety Framework, which includes dedicated explainability tools designed to scrutinize internal model reasoning prior to generating responses.

Google Blog

Topic	Replies	Views
谷歌 AI Mode 月活破 10 亿，搜索框 25 年最大升级常规 google , google-io , 搜索 , ai-mode , agent	3	May 20, 2026
Google Antigravity adds Gemini 3.5 Flash Low tier, cuts token usage by 45% for simple tasks 常规 antigravity , gemini , google , token-optimization	5	May 25, 2026
谷歌 I/O 发布 Gemini Omni，任意输入生成并对话编辑视频常规 ai , gemini , google , google-io , 视频生成	4	May 20, 2026
Google Antigravity 紧急将全付费档 Gemini 配额提升 3 倍并重置本周用量常规限速 , google , gemini , antigravity , ide	3	May 21, 2026
谷歌 Gemini CLI 将于 6 月 18 日停服，Antigravity CLI 今日发布常规 gemini , google , 开发者工具 , antigravity , cli	5	May 20, 2026

Google releases Gemini 3.5 Flash – 4x faster than other leading models

Related topics