Product explainer
Clear, confident voice for feature walkthroughs and onboarding flows.
Prompting Guide · Gemini TTS
Build director‑style prompts for Gemini TTS that control not just what is said, but how it's delivered. Inspired by Google's official prompting guide, adapted for modern SaaS teams.
Gemini Native Audio TTS isn't a simple text reader. It uses a large language model that understands scenes, characters and performance notes.
Think of yourself as a director: define the audio profile, the scene, and the director's notes so the virtual voice talent knows exactly how to perform your script.
Define the director notes once, then reuse the generated prompt in your Gemini TTS API calls. This mirrors Google's recommended structure: audio profile, scene, director's notes and transcript.
Higher quality favors clean, broadcast‑ready reads over wild variation.
More creativity lets the model improvise pacing and emphasis while staying on‑brief.
# AUDIO PROFILE Style: Warm conversational with a excited emotional tone. ## SCENE The voice is recorded in a soft studio environment, as if speaking directly to a modern SaaS audience. ### DIRECTOR'S NOTES Quality: Aim for a production‑ready read around 85% polish (clean, broadcast‑ready audio). Creativity: Around 60% creative freedom – enough to feel human, but still on‑brief. Keep consonants clear, avoid harsh sibilance, and land confidently on the final call‑to‑action. #### TRANSCRIPT Generate a short launch message for our new Gemini TTS powered voice feature.
Start from production‑ready patterns inspired by Google's Gemini TTS prompting guide, then adjust the persona, accent and pacing for your product.
Clear, confident voice for feature walkthroughs and onboarding flows.
Looser, social‑first read for YouTube shorts, TikTok and promo clips.
Relaxed, intimate host voice ideal for intros, ads and narration.
01
Pick a style, mood and environment. This becomes your audio profile and scene, just like Google’s guide suggests.
02
Dial in pacing, quality and creativity to control how Gemini TTS should sound, not just what it should say.
03
Paste the generated prompt into your GenerateContent request along with the transcript your app produces.
Compare a vague prompt with a structured, director‑style prompt adapted from the Gemini TTS documentation. The right side is what you'll actually send from Gemini TTS.
Gemini TTS, please read this feature list for our product launch in an engaging way.
Read this like a high‑energy SaaS launch announcer for a global audience. Keep a bright vocal smile, crisp consonants and zero dead air. Land confidently on the final call‑to‑action: {{script}}
Generate consistent, on‑brand voiceovers for launch videos, feature tours and site hero sections.
Turn help center articles into friendly, guided audio walk‑throughs for new users.
Give creators one‑click presets for UGC, sponsorship reads and podcast bumpers, all powered by Gemini TTS.
Reuse the same Audio Profile while swapping script language and accent instructions per market.