AI video just got another upgrade.
Veo 3.1 is now available inside Pencil - a major leap in quality, continuity, and control for creators who want cinematic precision without leaving their workflow.
Built on Google’s newest model, Veo 3.1 produces higher coherence, sharper motion, and stronger adherence to your prompts. It's perfect for brand storytelling, ads, or narrative concepts that rely on visual consistency.
If you want to start using Veo 3.1 in your ads today, get in touch with our team.
At its core, Veo 3.1 is about clarity and coherence. Each frame connects more naturally, delivering richer textures and more accurate movement.
It’s also now the most advanced video model available in Pencil: ideal for branded storytelling, product demos, and cinematic scenes.
Each generation costs 2 Generation Credits per second of video, keeping quality improvements simple and scalable across projects.
For the first time, creators can guide video generation using up to three reference images, giving you director-level control over characters, art direction, or layout consistency.
The feature is available now through the Video Generation Agent (just attach images as you would normally).
An easy in-app upload interface for the Video Generation Tool is also on the way.
Great for:
Veo 3.1 also expands the way you can connect clips together.
The ‘End Frame’ feature, which uses the final frame of one video as the starting point for the next, now works across Veo 2, Veo 3, and Veo 3.1.
That means smoother transitions, better story continuity, and the ability to create multi-shot scenes that feel cohesive from start to finish.
Perfect for step-by-step demos, product sequences, or short narrative ads.
This new model for AI video takes the leap from generation to direction.
The addition of Reference Images gives creative teams a way to lock in identity and style, while End Frame ensures those visuals stay consistent across scenes.
The result is cinematic quality paired with creative control — helping marketers, designers, and storytellers shape AI-generated content that looks deliberate, cohesive, and production-ready.