Speaker Mode (Lip Sync)
Create videos where your Meedi character speaks the words with lip-synced animation.
1
Start a Video
Free
+
Setup Phase
Klaus
Ja! Was für ein kind of video are we making today?
Choose wisely, mein friend! 🎬
Choose wisely, mein friend! 🎬
Action: Send "make a video", "animate meedi", or tap /animate in chat. Then tap Animation.
▶ Behind the Scenes
Klaus understands 21+ natural language triggers: "make a video", "create a video", "new video", "animate meedi", "meedi video", "let's make a video", "new animation", and more. No slash commands needed.
Cost: Free
Cost: Free
Step 1 of 16 — Setup Phase
2
Choose Speaker Mode
Free
+
Setup Phase
Klaus
Ja! Zeit for some animation magic! 🎨
Vhat style of video shall ve make?
Vhat style of video shall ve make?
Action: Tap Speaker. This is the lip-sync pipeline where Meedi speaks your words.
▶ Behind the Scenes
Speaker = Hedra Character-3 lip sync. The character's mouth moves to match the audio.
Narration = Kling animation only. Character reacts to voiceover without speaking.
Narration = Kling animation only. Character reacts to voiceover without speaking.
Step 2 of 16 — Setup Phase
3
Pick a Format
Free
+
Setup Phase
Klaus
Ooh, Speaker mode! Das ist very fancy!
Now, what format shall zis masterpiece be? 📐
Now, what format shall zis masterpiece be? 📐
Action: Choose your aspect ratio. 9:16 for short-form vertical video, 16:9 for YouTube/presentations, 1:1 for Instagram feed.
▶ Behind the Scenes
| Format | Best for |
|---|---|
| 9:16 Portrait | TikTok, Reels, Shorts |
| 16:9 Landscape | YouTube, presentations |
| 1:1 Square | Instagram feed, LinkedIn |
Step 3 of 16 — Setup Phase
4
Choose Your Mascot
Free
+
Setup Phase
Klaus
Format: 9:16 Portrait
Model: Hedra Character-3
Who is ze star of zis show? 🎭
Model: Hedra Character-3
Who is ze star of zis show? 🎭
Action: Pick your character. Each mascot has a unique AI voice.
▶ Behind the Scenes
| Mascot | ID | Voice |
|---|---|---|
| Mr Meedi | `mr_meedi` | Meedi (ElevenLabs) |
| Mrs Meedi | `mrs_meedi` | Mrs.Meedi (ElevenLabs) |
Step 4 of 16 — Setup Phase
5
Pick a Scene Style
Free
+
Setup Phase
Klaus
💜 Mrs Meedi it is!
Real world or Pixar magic? 🎬
Real world or Pixar magic? 🎬
Action: Choose the visual rendering style for backgrounds.
▶ Behind the Scenes
Pixar 3D: Taxi Ride, The Office, Park Walk, Living Room, Busy Club, Urban Cafe, Teen Bedroom, School Hallway
Photorealistic: Driving a Car, Car UGC, On a Train, On a Plane, High Street
Photorealistic: Driving a Car, Car UGC, On a Train, On a Plane, High Street
Step 5 of 16 — Setup Phase
6
Audio Source
<$0.01
+
Script Phase
Klaus
Scene style: Pixar 3D
How shall we create ze voiceover?
I can generate it, or you can bring your own! 🎤
How shall we create ze voiceover?
I can generate it, or you can bring your own! 🎤
Action: Choose how the voiceover is created. Generate = Klaus writes and voices it. Upload = your own audio file. URL = paste a link.
▶ Behind the Scenes
Generate uses ElevenLabs TTS (~$0.00003/char). Upload accepts MP3, WAV, M4A, OGG, FLAC, WEBM. URL supports direct audio links and YouTube URLs (extracted via yt-dlp).
Cost: ~$0.01 for 80 words (Generate). Free for Upload/URL.
Cost: ~$0.01 for 80 words (Generate). Free for Upload/URL.
Step 6 of 16 — Script Phase
7
Enter Your Topic or Script
Free
+
Script Phase
Klaus
Wunderbar! I vill write ze script and do ze voice!
Want me to make de nue musikvideo for 99 Luftballons in 2026? Or something else? 📝
Want me to make de nue musikvideo for 99 Luftballons in 2026? Or something else? 📝
Action: Type your topic (e.g. "Why breakfast is important") or paste a full script. Klaus handles both.
▶ Behind the Scenes
Accepts topics, full scripts, or rough drafts. Word count estimates: ~15s ≈ 40 words, ~30s ≈ 80 words, ~60s ≈ 150 words.
Step 7 of 16 — Script Phase
8
Script Mode
<$0.01
+
Script Phase
Klaus
Got it! (30 words)
How should I use this?
How should I use this?
Action: Use my words keeps your exact text. Generate from topic lets Klaus write a fresh script based on your input.
▶ Behind the Scenes
"Generate from topic" uses Gemini Flash (~$0.001/call) to write a polished script from your topic or rough text.
Step 8 of 16 — Script Phase
9
Review the Script
<$0.01
+
Script Phase
Klaus
📝 Bold Pop subtitles!
━━━━━━━━━━
[MEEDI] Did you know a single makeover can change your whole vibe?...
━━━━━━━━━━
Duration: ~12s
Is zis ze vibe? 🤔
━━━━━━━━━━
[MEEDI] Did you know a single makeover can change your whole vibe?...
━━━━━━━━━━
Duration: ~12s
Is zis ze vibe? 🤔
Action: Review your script and approve, or use the rewrite buttons to adjust tone, length, or edit directly.
▶ Behind the Scenes
Rewrites use Gemini Flash (~$0.001/call). Each rewrite option gives specific direction: "More Casual" = conversational tone, "Shorter" = condense, "Longer" = expand with examples.
Step 9 of 16 — Script Phase
10
Subtitle Style
Free
+
Script Phase
Klaus
What style for ze subtitles? 📝
Action: Choose how subtitles appear. Tap Preview All to see samples before committing.
▶ Behind the Scenes
| Style | Font | Size | Highlight Color | Notes |
|---|---|---|---|---|
| Bold Pop | Montserrat | 116pt | Yellow | 8px black outline |
| Hormozi | Montserrat | 108pt | Gold | 6px shadow, No outline |
| Boxed | Roboto | 92pt | Cyan | Semi-transparent black box |
| Minimal | Roboto | 72pt | Light gray | 2px black outline, 2px shadow |
| Neon Karaoke | Montserrat | 104pt | Neon magenta | 5px black outline, Glow effect |
Step 10 of 16 — Script Phase
11
Scene Count
Free
+
Scenes Phase
Klaus
📊 Your video is 10.6 seconds long.
How many scenes?
How many scenes?
Action: Choose how many scenes. Klaus shows the approximate duration per scene. Maximum 8 scenes.
▶ Behind the Scenes
More scenes = more visual variety but higher cost. Each lip-sync scene costs ~$0.17 (Hedra). B-Roll scenes are free. The available scene counts depend on your audio length.
Step 11 of 16 — Scenes Phase
12
Per-Scene Setup
Free
+
Scenes Phase
Klaus
Scene 1 of 3
Choose how this scene looks:
🎤 Lip Sync — avatar speaks with lip sync
🎬 Animation — animated clip from still image
📹 B-Roll — stock footage while audio plays
⏱ ~3s
"Did you know a single makeover can change your whole vibe?"
Choose how this scene looks:
🎤 Lip Sync — avatar speaks with lip sync
🎬 Animation — animated clip from still image
📹 B-Roll — stock footage while audio plays
⏱ ~3s
"Did you know a single makeover can change your whole vibe?"
For each scene, you configure 4 things in sequence:
Step 1: Type
Step 2: Mood
Step 3: Background
Step 4: Transition (except last scene)
Action: Configure each scene's type, mood, background, and transition. An Edit Script button is available on every sub-step.
▶ Behind the Scenes
| Scene Type | Cost | Engine |
|---|---|---|
| Lip Sync | ~$0.17/scene | Hedra Character-3 |
| Animation | ~$0.21-0.28/scene | Kling v2.1/v2.5 |
| B-Roll | Free | Pexels stock footage |
Step 12 of 16 — Scenes Phase
13
Master Plan Review
Free
+
Scenes Phase
Klaus
Production Plan (3 scenes, 10.6s)
Scene 1 — Hook (3s) 🎥 Lip Sync
🎨 Taxi Ride
"Did you know a single makeover..."
↓ fade
---
Scene 2 — Buildup (3s) 🎥 Lip Sync
🎨 warm office with bookshelves
"I'm Mrs Meedi and Pamela..."
↓ fade
---
Scene 3 — Closer (3s) 🎥 Lip Sync
🎨 warm office with bookshelves
"This is just the beginning..."
Model: Hedra | Style: Fun & Colourful | Format: 9:16
Scene 1 — Hook (3s) 🎥 Lip Sync
🎨 Taxi Ride
"Did you know a single makeover..."
↓ fade
---
Scene 2 — Buildup (3s) 🎥 Lip Sync
🎨 warm office with bookshelves
"I'm Mrs Meedi and Pamela..."
↓ fade
---
Scene 3 — Closer (3s) 🎥 Lip Sync
🎨 warm office with bookshelves
"This is just the beginning..."
Model: Hedra | Style: Fun & Colourful | Format: 9:16
Action: Review the complete production plan. In Speaker mode, this often auto-proceeds after scene setup.
Step 13 of 16 — Scenes Phase
14
Audio Generation
<$0.01
+
Scenes Phase
Klaus
Generating audio... 🎤
You're free to chat or start another video while I work!
You're free to chat or start another video while I work!
Action: Klaus generates the TTS audio and analyzes timing. This takes ~10 seconds. You're free to chat during this.
▶ Behind the Scenes
ElevenLabs TTS generates the voiceover. Whisper transcribes it for subtitles. The audio is analyzed for timing to calculate scene durations.
Cost: ~$0.01 (TTS) + ~$0.003 (Whisper)
Cost: ~$0.01 (TTS) + ~$0.003 (Whisper)
Step 14 of 16 — Scenes Phase
15
Generation + Scene Gates
$0.17+/scene
+
Generation Phase
Klaus
🎬 Starting animation pipeline...
⏳ Initializing...
⏳ Initializing...
Klaus
AI Director — Scene 1/3 [Character]
Emotion: curious
Direction: Mrs. Meedi leans forward slightly, gesturing expansively...
Technique: CUT
Emotion: curious
Direction: Mrs. Meedi leans forward slightly, gesturing expansively...
Technique: CUT
Klaus (after scene generates)
[Scene 1 video clip]
Action: Each scene generates in sequence. After each, you see the result and can Approve, Regenerate (costs another generation), or Cancel. Takes 30-90 seconds per scene.
▶ Behind the Scenes
For each scene: (1) AI Director plans emotion and camera direction, (2) Image generated from background preset, (3) Hedra lip-syncs the audio onto the character image, (4) You review the result.
The Cinematographer enforces environment consistency — all scenes use the same background environment as Scene 1 to maintain visual coherence.
Cost: ~$0.17/scene (Hedra lip sync). Regeneration costs another $0.17.
The Cinematographer enforces environment consistency — all scenes use the same background environment as Scene 1 to maintain visual coherence.
Cost: ~$0.17/scene (Hedra lip sync). Regeneration costs another $0.17.
Step 15 of 16 — Generation Phase
16
Edit Menu
Free
+
Post-Production Phase
Klaus
Your video is ready! Here's the Edit Menu. All changes are free!
Action: Make free post-production edits (music, subtitles, volume, transitions, effects) or tap Done to name and save your video. See the Post-Production tab for details on each option.
▶ Behind the Scenes
ALL edit menu changes are free — they use FFmpeg locally with no API costs. The only paid action is "Change Scene" in the Scene Carousel, which regenerates a scene.
Step 16 of 16 — Post-Production Phase
Narration Mode
Create videos where your Meedi character reacts to a voiceover with AI-directed emotions and movement.
1
Start a Video same as Speaker
Free
+
Setup Phase
Action: Send "make a video" or "animate meedi" and tap Animation. Same as Speaker mode.
Step 1 of 11 — Setup Phase
2
Choose Narration Mode
Free
+
Setup Phase
Klaus
Vhat style of video shall ve make?
Action: Tap Narration. The character will react to audio rather than speak.
Step 2 of 11 — Setup Phase
3
Pick a Format same as Speaker
Free
+
Action: Choose 9:16, 16:9, or 1:1. Same options as Speaker mode.
Step 3 of 11 — Setup Phase
4
Choose Your Mascot same as Speaker
Free
+
Setup Phase
Klaus
Who is ze star of zis show?
Action: Pick your mascot. Same choice as Speaker mode.
▶ Behind the Scenes
Visual style (Fun & Colourful), image model (Gemini), and animation model (Kling v2.5 Turbo) are auto-selected — no picker shown. This keeps the narration flow streamlined.
Step 4 of 11 — Setup Phase
5
Audio Source same as Speaker
<$0.01
+
Action: Generate, Upload, or From URL. Same as Speaker mode.
Step 5 of 11 — Script Phase
6
Topic → Script Mode → Review same as Speaker
<$0.01
+
Action: Enter topic, choose script mode, review and approve. Same flow as Speaker mode steps 7-9.
Step 6 of 11 — Script Phase
7
Duration
Free
+
Scenes Phase
Klaus
How long should ze video be?
Action: Only shown for the "Generate from topic" path. Uploaded audio uses its actual duration.
Step 7 of 11 — Scenes Phase
8
B-Roll Percentage
Free
+
Scenes Phase
Klaus
How much stock footage to mix in?
Action: B-Roll scenes use free Pexels footage, reducing costs. 50-75% is great for budget-friendly videos. Only shown if audio is longer than 10s.
Step 8 of 11 — Scenes Phase
9
Script Review / Make it!
Free
+
Scenes Phase
Klaus
Perfekt! Here is ze master plan:
Mascot: Mr Meedi
Style: Fun & Colourful
Image Model: Gemini
Format: 9:16 Portrait
Animation: Kling v2.5 Turbo
B-roll: 25%
Script: Ready!
Mascot: Mr Meedi
Style: Fun & Colourful
Image Model: Gemini
Format: 9:16 Portrait
Animation: Kling v2.5 Turbo
B-roll: 25%
Script: Ready!
Action: Review your configuration and script summary. Tap Make it! to start generation.
Step 9 of 11 — Scenes Phase
10
Generation + Scene Gates
$0.21+/scene
+
Generation Phase
Klaus
For each scene in sequence:
1. Image generation — you see the image and can approve/regenerate/stop
2. Kling animation — the image is animated
3. Video gate — you see the clip and can continue/stop
1. Image generation — you see the image and can approve/regenerate/stop
2. Kling animation — the image is animated
3. Video gate — you see the clip and can continue/stop
Action: Scene gates auto-approve after a timeout (30s for first image, 10-20s for subsequent). Tap Stop & Fix to intervene.
▶ Behind the Scenes
Cost per scene: Image (~$0.04 Gemini) + Kling v2.5 Turbo ($0.21) = ~$0.25 per scene. B-Roll scenes = free.
Step 10 of 11 — Generation Phase
11
Edit Menu same as Speaker
Free
+
Action: Same Edit Menu as Speaker mode. All changes free (FFmpeg). See the Post-Production tab for full details.
Step 11 of 11 — Post-Production Phase
?
Speaker vs Narration — Key Differences
+
| Feature | Speaker | Narration |
|---|---|---|
| Character behavior | Speaks (lip sync) | Reacts to voiceover |
| Character selection | Mr or Mrs Meedi | Mr or Mrs Meedi (same) |
| Backgrounds | Preset backgrounds | AI-generated per scene |
| Visual style | Per-scene mood | Auto-set (Fun & Colourful) |
| Scene types | Lip Sync, Animation, B-Roll | Animation and B-Roll only |
| Back navigation | Back buttons on every step | No Back buttons (linear) |
| Video engine | Hedra + Kling | Kling v2.5 Turbo |
| Cost per scene | ~$0.17-0.26 | ~$0.21 (Turbo) |
Library & Repurposing
Browse finished videos, reuse assets, and re-lipsync individual scenes.
1
Open the Library
Free
+
You
my videos
Action: Type
/library, or say naturally: "my videos", "show library", "what have i made", "browse content", etc. 21+ natural language triggers supported.Step 1 of 7
2
Character Filter
Free
+
Klaus
Your Library
Active Projects: [status shown if any in progress]
Active Projects: [status shown if any in progress]
Action: Tap a character to browse their videos. Active projects are shown at the top with generation status.
Step 2 of 7
3
Video Grid
Free
+
Klaus
[Thumbnail collage: numbered grid of video thumbnails, newest first. Up to 9 per page with pagination.]
Action: Tap a number to open that video's detail view. Use Next/Prev for pagination.
Step 3 of 7
4
Video Detail
Free
+
Klaus
"My Breakfast Video"
Created: Feb 24, 2026
Duration: 32s | 4 scenes
Created: Feb 24, 2026
Duration: 32s | 4 scenes
Action: Edit Scenes reopens the Edit Menu. Rewatch re-sends the video. Download sends as document. Delete removes (with confirmation).
Step 4 of 7
5
Reuse Assets
Free
+
Klaus
Reuse assets from this video:
Action: Start a new video using existing assets. Fastest way to create — skip writing/recording entirely. Great for A/B testing different visual styles.
Step 5 of 7
6
Re-lipsync
$0.17-0.60
+
Re-lipsync
Klaus
Pick a scene to re-lipsync, then choose a provider:
Action: Non-destructive — the original scene is always preserved. Pick a scene from the visual collage, choose a provider, upload new audio.
▶ Behind the Scenes
Image-based (still image + audio): Hedra ($0.17, best value), Kling Avatar ($0.26, full body), OmniHuman ($0.60, premium expressions).
Video-based (existing video + audio): SyncLabs ($0.25, preserves visuals), Kling Lip-Sync ($0.28, high quality re-sync).
Video-based (existing video + audio): SyncLabs ($0.25, preserves visuals), Kling Lip-Sync ($0.28, high quality re-sync).
Step 6 of 7
7
Search
Free
+
You
/search breakfast
Action: Use
/search keyword to find videos by title. Deep links also work: "Mrs Meedi videos" jumps straight to that section.Step 7 of 7
Post-Production (Edit Menu)
All edit menu changes are free — powered by FFmpeg with no API costs.
1
Edit Menu Overview
Free
+
Klaus
Your video is ready!
Action: This is your free post-production toolkit. Every change uses FFmpeg locally — no API calls, no costs. Experiment freely!
| Button | What It Does |
|---|---|
| Scenes | Scene Carousel — per-scene editing |
| Music | Add/change/remove background music |
| Volume | Adjust voiceover volume (0-200%) |
| Subtitles | Change subtitle style |
| Effects | Toggle film grain and effects |
| Transitions | Overview of all scene transitions |
| Done | Name and save to Library |
| Delete | Discard video (with confirmation) |
Step 1 of 10
2
Scene Carousel
Free*
+
Klaus
[Scene 2 of 4 video clip]
Action: Navigate scenes, edit subtitle text, change transitions, or regenerate a scene. *Change Scene costs money (re-generates).
Step 2 of 10
3
Transitions
Free
+
Klaus
S1→S2: Fade
S2→S3: Cut
S3→S4: Zoom In
S2→S3: Cut
S3→S4: Zoom In
Action: See all transitions at once. Tap any to cycle through: Cut → Fade → Dissolve → Zoom In → Circle Open → Fade Black.
Step 3 of 10
4
Add Music
Free
+
Klaus
Music options:
Action: Add background music from file or URL. Music is mixed with voiceover. Tip: 30-40% music volume works best under speech.
Step 4 of 10
5
Volume & Music Volume
Free
+
Volume: Voiceover volume 0-200%. 100% = original. Music Vol: Separate control for background music (appears after adding music).
Step 5 of 10
6
Change Subtitles
Free
+
Klaus
Pick a new subtitle style:
Action: Re-renders video with new subtitle style. Free (FFmpeg). You can change styles as many times as you like.
Step 6 of 10
7
Fix Text
Free
+
Action: Correct words the speech-to-text (Whisper) got wrong. Names, technical terms, and unusual words are common culprits. Type the correction and the video re-renders.
Step 7 of 10
8
Effects
Free
+
Action: Toggle film grain (on by default) and other post-processing effects. Film grain adds a cinematic feel — it's subtle and free.
Step 8 of 10
9
Finish (Done)
Free
+
Klaus
Name your video:
Action: Tap Done, type a title, and your video is archived to the Library with all assets (scenes, audio, script). You can reopen the Edit Menu any time from the Library.
Step 9 of 10
10
Delete
Free
+
Action: Tap Delete to discard. Klaus asks for confirmation first. Library assets (scenes, audio, script) are preserved separately, so deleting the final video doesn't destroy the building blocks.
Step 10 of 10
$
Cost Summary
+
30s Speaker Mode (3 scenes)
| Step | Service | Cost |
|---|---|---|
| Script | Gemini Flash | $0.001 |
| TTS (~80 words) | ElevenLabs | $0.01 |
| Transcription | Whisper | $0.003 |
| 3x lip sync | Hedra | $0.51 |
| Assembly + edits | FFmpeg | Free |
| Total | ~$0.52 |
60s Narration Mode (6 scenes, 33% B-Roll)
| Step | Service | Cost |
|---|---|---|
| Script + Director | Gemini + Claude | $0.03 |
| TTS (~150 words) | ElevenLabs | $0.02 |
| 3x CUT images | Gemini Image | $0.12 |
| 4x Kling animations | Kling v2.5 Turbo | $0.84 |
| 2x B-Roll | Pexels | Free |
| Total | ~$1.01 |