Google leans on full-stack AI edge, says Cloud customers could save $1B+ a year by shifting 80% of workloads to Gemini 3.5 Flash

by | May 29, 2026 | Latest E-commerce News & Updates

Google is pitching cost and speed as its competitive edge in AI, with CEO Sundar Pichai saying top Google Cloud customers could save more than $1B a year by shifting 80% of their AI workloads to a mix of Gemini 3.5 Flash and other frontier models. Pichai said companies are “already blowing through their annual token budgets and it's only May,” as monthly usage of Google's AI products has jumped sevenfold to 3.2 quadrillion tokens since last year. Google's edge comes from owning the full stack: chips, data centers, cloud, models, and applications. William Blair analysts estimated earlier this month that Google pays around 50% less, and possibly up to 75% less, for internal AI compute than rivals because it uses its own TPU chips. OpenAI, by contrast, pays Microsoft, Oracle, and other cloud giants a margin on every ChatGPT and Codex request.

Paul Drecksler is the founder and editor of Shopifreaks, covering the most important stories in e-commerce.

Companies: Google

Never miss important e-commerce news

Our weekly newsletter is read religiously by 20,000+ e-commerce professionals.

Loading...