0
Google I/O 2026
FIG. 028σ 90
FIELD REPORT · AI

Google I/O 2026

What actually shipped, what nobody covered, and what it means

Isaiah Steinfeld
Listen to this article
0:00/0:00

THE SCALE NUMBERS

3.2Q
Tokens / month
(up from 9.7T two years ago)
19B
Tokens / minute
via Gemini API
900M
Gemini app MAU
(doubled in 12 months)
~$185B
2026 Capex
(6x 2022 spend)
8.5M
Developers building
on Google models monthly
2.5B
AI Overviews
users

These aren’t vanity metrics. They describe the throughput floor of an infrastructure layer that everything else at I/O was built on top of.

MODELS: FLASH GETS FAST, OMNI GETS PHYSICAL

Gemini 3.5 Flash is live as of May 19. It outperforms 3.1 Pro across nearly all benchmarks, with a notable jump on GDPVal (economically valuable real-world tasks). It runs 4x faster on output tokens per second than comparable frontier models — 12x faster inside Antigravity specifically — at less than half the cost.

Cost MathCompanies processing 1 trillion tokens/day could save over $1B annually by shifting 80% of workloads to Flash.

Gemini 3.5 Pro is in internal testing, slated for June. Described as combining frontier intelligence with stronger agentic capabilities. No benchmarks shared.

Gemini Omni Flash is the more structurally interesting model. Demis Hassabis introduced it as capable of generating output in any modality from any input. The key distinction: it was specifically trained on physics simulation. Understanding of gravity, kinetic energy, and fluid dynamics is a stated training objective — not a feature bolted on afterward.

Omni Flash creates custom AI avatars that look and sound like you, live today in the Gemini app.
AGI Timeline — Stated From StageDemis Hassabis said directly that AGI is “just a few years away.” The most explicit timeline declaration Google has ever made in a keynote setting.

TPU 8: THE ARCHITECTURE SPLITS

For the first time, Google has divided TPU into two distinct chips. TPU 8t is optimized for training: JAX and Pathways now distribute training across multiple data centers simultaneously, scaling to over 1 million TPUs globally. TPU 8i is optimized purely for inference latency. Both deliver up to 2x better performance-per-watt versus prior generation.

Why This MattersThe split acknowledges that training and inference are fundamentally different workloads with different cost structures, and optimizing one chip for both is leaving performance on the table.

GEMINI SPARK: A PERSISTENT AGENT OUTSIDE YOUR SESSION

Spark is Google’s personal AI agent. The mechanics matter more than the pitch: it runs on dedicated virtual machines on Google Cloud, operates 24/7 without a laptop open, is powered by Gemini 3.5 Flash and the Antigravity harness for long-horizon background tasks, and is accessible via the Gemini app, email, and chat — meaning Spark is an addressable agent endpoint, not just a UI element.

Users will be able to create custom sub-agents, composing their own agent pipelines inside Spark. Payment authorization is coming — specify a budget and merchant list and Spark executes purchases autonomously. Chrome integration as an agentic browser arrives this summer. Android Halo (persistent status bar showing live agent activity) comes later this year.

Beta is live now for AI Ultra US subscribers, rolling out to Gmail, Chat, then Chrome.

SEARCH: THE QUIET TRANSFORMATION

The new search box accepts text, images, files, video, audio, PDFs, and your entire Chrome tab as input. It adapts contextually. Rolling out today in all countries and languages where AI Mode is available.

AI Overviews and AI Mode are merging into one unified surface. The underlying architecture is being consolidated so a single model handles both.

Information Agents are persistent background watchers for topics you define: stock changes, sneaker drops, website changes, job listings. They send alerts. Rolling out this summer.

Generative UI dynamically builds layouts, tables, and interactive widgets on the fly inside search results. Free for everyone.

Search will call businesses on your behalf for select service categories — home repair, beauty, pet care — to get information, pricing, or book appointments. Duplex-level behavior built natively into Search. Rolling out to everyone in the US this summer.

Personal Intelligence in AI Mode is expanding to nearly 200 countries across 98 languages — free, no subscription required.

UNIVERSAL CART AND THE TWO PROTOCOLS NOBODY DISTINGUISHED

The Shopping blog post describes two architecturally distinct systems that every summary conflated.

UCP vs AP2

Universal Commerce Protocol (UCP) is a shared checkout language across merchants — it allows the cart to read product data, pricing, inventory, and compatibility from any participating retailer.

Agent Payments Protocol (AP2) is separate: a protocol for agent-executed purchases using privacy-preserving technology, creating a tamper-proof mandate with a permanent digital paper trail. AP2 hits Gemini Spark first; UCP is the merchant-side protocol.

The cart is built on Google Wallet — Wallet’s payment intelligence (perks, loyalty, merchant offers) is the cart’s decision-making layer. This appeared in no coverage.

Launching US this summer. Compatible retailers at launch: Target, Walmart, Sephora, Wayfair, Nike, and Shopify merchants.

CHROME: THE 15 UPDATES THAT MATTER

The Chrome changes are more architecturally significant than anything in the consumer keynote.

FeatureWhat It DoesStatus
WebMCPOpen web standard exposing JS functions as machine-readable tools for browser agentsChrome 149 origin trial
Modern Web GuidanceExpert-vetted agent skills covering 100+ web dev use casesLive
DevTools for AgentsConsole logs, network traffic, accessibility trees for 20+ coding toolsLive today
HTML-in-Canvas APIReal DOM elements in WebGL/WebGPU canvasOrigin trial
Prompt APIStable multimodal inputs, structured output, reliable JSONChrome 148 stable
Gemma 197MUltra-efficient in-browser model shared across tabs on low-end devicesLive
Soft Navigations APICore Web Vitals for SPAs (Next.js, React, Nuxt)Live
Skills in ChromeSave and reuse multi-tab AI workflows as one-click macrosComing

GEMINI FOR SCIENCE: PEER-REVIEWED, NOT JUST DEMOED

Three experimental tools live in Google Labs: Hypothesis Generation (Co-Scientist), Computational Discovery (AlphaEvolve + ERA), and Literature Insights (NotebookLM).

Named enterprise production partnerships: BASF (AlphaEvolve for supply chain), Klarna (AlphaEvolve enhancing ML models), Daiichi Sankyo and Bayer Crop Science (Co-Scientist), and U.S. DOE National Labs under the Genesis Mission.

ERA and Co-Scientist research papers were published in Nature on May 19 — the same day as I/O. Peer-reviewed validation, not just a demo.

WHAT TO WATCH

01
Agents Are Becoming Addressable Endpoints
Spark lives on a VM, accepts email and chat, runs 24/7, and will soon compose sub-agents. This is not “AI assistant in an app.” This is infrastructure.
02
Commerce Is Getting an Agent Protocol Layer
AP2 creates tamper-proof purchase mandates. UCP standardizes merchant data. Wallet’s intelligence is the cart. These are the primitives of agent-mediated commerce.
03
The Browser Is Becoming an Agent Platform
WebMCP, DevTools for Agents, Skills in Chrome, Gemma 197M shared across tabs — Chrome is being rebuilt as the surface where agents operate.
04
Science Tools Shipped With Peer Review
Two Nature papers on the same day as a keynote. A credibility move that matters more for long-term positioning than any consumer demo.
05
The Coverage Gap Is the Signal
The volume of things that received zero press coverage suggests the gap between what Google shipped and what the ecosystem is equipped to parse is widening. That gap is where the signal lives.