Google
Google

Veo 3.1

2026-01-01

Veo 3.1 is Google's state-of-the-art video generation model, producing high-fidelity 8-second clips at up to 4K resolution with natively generated audio — dialogue, sound effects, and background music synchronized to the visuals. It supports text-to-video, image-to-video with up to three reference images, portrait and landscape framing (9:16, 16:9), frame-specific generation, and scene extension for creating longer videos exceeding one minute. The model offers greater narrative control with improved understanding of cinematic styles and richer native audio.

Google AI UltraAPI|Vision|Proprietary Model
Knowledge Cutoff
2026-01-01
Input → Output Format
Context Memory
N/A
API Cost
$0.4 ~ $0.6/ sec
Calculate Cost