Google launches Gemini Omni Flash video model that lets users create and edit videos through natural language conversation

by | May 20, 2026 | Latest E-commerce News & Updates

Google announced Gemini Omni, a new family of multimodal AI models starting with Gemini Omni Flash that can create high-quality videos from any combination of image, audio, video, and text inputs, with users able to edit videos through natural language conversation while characters, physics, and scene context remain consistent across edits. The model includes features like accurate physics simulation, the ability to blend multiple reference inputs into a single cohesive output, and Avatars (a digital version of the user that can be used to generate videos featuring their own voice), with all videos including Google's imperceptible SynthID digital watermark for content transparency. Gemini Omni Flash is rolling out today to all Google AI Plus, Pro, and Ultra subscribers globally through the Gemini app and Google Flow, and rolling out at no cost to users on YouTube Shorts and YouTube Create App starting this week, with future plans to support image and audio output modalities and to extend the model to developers and enterprise customers via APIs.

Paul Drecksler is the founder and editor of Shopifreaks, covering the most important stories in e-commerce.

Companies: Google

Never miss important e-commerce news

Our weekly newsletter is read religiously by 20,000+ e-commerce professionals.

Loading...