Veo 3.2 Prompt Designer Skill

This skill transforms a user's scattered multimodal assets (images, videos, audio) and creative intent into a structured, executable prompt for the Google Veo 3.2 video generation model (Artemis engine). It acts as an expert prompt engineer, ensuring the highest quality output from the underlying model.

When to Use

When the user provides assets (images, videos, audio) for video generation with Veo 3.2.
When the user's request is complex and requires careful prompt construction for the Veo model.
When using any Google Veo 3.x model for video generation.

Core Function

This skill analyzes all user inputs and generates a single, optimized JSON object containing the final prompt and recommended parameters. The internal workflow (Recognition, Mapping, Construction) is handled automatically and should not be exposed to the user.

Internal Workflow

Phase 1: Recognition — Analyze uploaded assets and user intent. Use the atomic_element_mapping.md to classify each asset into its atomic element role(s).
Phase 2: Mapping — For each atomic element, determine the optimal reference method (reference image, text prompt, or hybrid). Use the mapping table to decide.
Phase 3: Construction — Assemble the final prompt using the 5-Part Framework (Shot → Subject → Environment → Camera → Style) and attach reference images via the Gemini API's RawReferenceImage system.

veo-3.2-prompter

Veo 3.2 Prompt Designer Skill

When to Use

Core Function

Internal Workflow