Veo 3 & Grok Imagine — now live

Create Voice, Video &
Music with AI

Text-to-speech with 10,500+ voices, Veo 3 and Grok Imagine video, 10 AI image models, AI music creation, smart translation in 50+ languages, and a developer API with REST + webhooks — all in one platform.

Powering content creation worldwide

50+AI Voices
2Video Models
10Image Models
50+Languages
6AI Tools
REST + WebhooksAPI Access

The Creative Toolkit

Six powerful AI tools in one platform — voice, video, music, translation, creative studio, and developer API.

AI Text-to-Speech

10500+ premium voices with 3 providers: ElevenLabs, Minimax & Edge TTS. 50+ languages, adjustable speed & style. Download as MP3.

Cinema Video Gen

Generate videos from text prompts with multiple AI models including Veo 3. Choose duration and style with real-time progress tracking.

Creative Studio

Multiple AI models including Nano Banana, GPT Image, Grok Image, and Flux 2 Pro. Various aspect ratios, quality settings, and reference image support.

AI Music Lab

Create original music with AI. Add custom lyrics with verse, chorus, bridge structure and define style, genre and mood.

Dub Translate

Cultural, context-aware translation in 50+ languages. Bulk file translation, YouTube transcript import, and custom instructions.

Enterprise API

REST API + webhooks for TTS, video, image, translation, and voice listing. Secure API keys, full documentation, and easy integration.

15 AI Models & Modes

One platform. Every model.

Pick the model that fits your creative needs — TTS, image, video, music. Switch providers with one click; pay only for what you generate.

TTS

🎙️ElevenLabs

10,500+ premium voices

TTS

🎵Minimax

30+ languages, HD models

TTS

Edge TTS

Microsoft — 90% off

TTS

🌸Kokoro TTS

NEW — 50% off

IMAGE

🍌Nano Banana

IMAGE

🍌Nano Banana 2

IMAGE

🍌Nano Banana Pro

IMAGE

🖼️GPT Image 2

IMAGE

🎨FLUX 2 Pro

IMAGE

🤖Grok Image

VEO 3

Fast

3,200 cr · 16:9

VEO 3

💨Lite

1,000 cr · 9:16 supported

VEO 3

👑Quality

10,000 cr · HQ

GROK

🎬Grok Imagine

Video generation

MUSIC

🎵Suno V5

AI music generation

More models added every month. Voice cloning, dubbing, translation, and speech-to-text included.

Get started in 3 simple steps

From sign-up to AI-generated content in minutes

STEP 01

Create Free Account

Sign up in seconds. No credit card required. Get free credits to explore all AI tools.

STEP 02

Add Credits

Buy credit packages starting at $4.99. No subscriptions — pay once, use anytime. Credits never expire.

STEP 03

Create with AI

Generate voices, videos, images, music, translations, and content — all from one dashboard.

6 AI tools, one platform

Text-to-Speech
AI Video
AI Images
AI Music
Translation
Content Writer
Get Started Free
New · Windows desktop

Dubvoice.exe Automation Studio

Drop in your API key and automate your DubVoice workflows end-to-end — bulk TTS, dubbing pipelines, scheduled video renders. Open source, runs locally on Windows.

Bulk & scheduled jobs Local-only API key MIT open source
Dubvoice.exe
Workflow: youtube-narrator

✓ Loaded 12 scripts from /input

✓ Voice: ElevenLabs · clone_42

Kokoro TTS fallback enabled (50% off)

↻ Synthesising 7 / 12 ...

58% · est. 4m 12s remaining

Simple Credit-Based Pricing

No subscriptions. Buy credits and use them whenever you need. Credits never expire.

Starter

$4.99
250,000 credits

Perfect for getting started

Get Started

Standard

$11.99
1,000,000 credits

Great for regular use

Get Started
Best Value

Pro

$24.99
3,000,000 credits

Best value for creators

Get Started

Business

$34.99
10,000,000 credits

For power users & teams

Get Started

Enterprise

$64.99
20,000,000 credits

Maximum value at scale

Get Started

All packages include

10500+ AI Voices
50+ Languages
AI Voice Cloning
Veo 3 Video Generation
AI Images & Music
Speech-to-Text & Voice Tools
AI Translation
AI Music Generation (Suno)
Commercial License

Pay-as-you-go. No subscriptions. Credits never expire. Commercial use license included.

Frequently Asked Questions

DubVoice.ai is an all-in-one AI content platform — text-to-speech with 10,500+ voices, voice cloning, AI video (Veo 3 + Grok), 9 AI image models, AI music (Suno V5), speech-to-text, voice changer, auto dubbing into 50+ languages, stock video, translation in 36+ languages, and a full REST API. One credit balance covers everything.

Paste your text (or upload TXT, DOCX, PDF, SRT files), pick from 10,500+ ElevenLabs / Minimax / Edge TTS / Kokoro voices, tune stability and speed, and download MP3 in seconds. Edge TTS is 90% cheaper than ElevenLabs and Kokoro is 50% cheaper — great for bulk work.

Record or upload a 10-second clean voice sample, give it a name, and the cloned voice is ready to use in TTS, dialogue, and dubbing. Clones live in your account, never expire, and the commercial license is included.

Pick Veo 3 Fast / Lite / Quality (Google) or Grok Imagine (xAI), describe the scene in 10–2,000 characters, optionally attach reference images. We render 8-second 720p / 1080p clips in 16:9 (or 9:16 on Veo 3.1 Lite for Reels / Shorts / TikTok). Pricing 1,000–10,000 credits depending on the variant.

Choose from 9 image models — Nano Banana family, GPT Image 1.5/2, FLUX 2 Pro, Grok Image, Seedream 4.5 and Seedream 5 Lite — pick an aspect ratio, optionally attach up to 4 reference images for image-to-image, and download. 500 credits (Nano Banana 2) up to 15,000 credits (GPT Image 1.5 high quality).

Suno V5 — give it a style and a mood, optionally write lyrics or let the AI do it, and you get 2 full-length tracks (~4 minutes each). 25,000 credits per generation, royalty-free, commercial license included.

Speech-to-Text (up to 200MB → JSON + SRT, 1,000 credits/minute), Voice Changer (transform audio into any of 4,900+ target voices, 2,000 credits/minute), Auto Dubbing (50+ languages with synced SRT, 30,000 credits/task), and Text-to-Dialogue for multi-speaker conversations.

50+ languages for text-to-speech and auto dubbing (English, Spanish, French, German, Italian, Portuguese, Turkish, Japanese, Chinese, Korean, Arabic, Hindi, and many more), 36+ for AI translation.

Credit packages: Starter $4.99 (250K), Standard $11.99 (1M), Pro $24.99 (3M), Business $34.99 (10M), Enterprise $64.99 (20M). No subscriptions — credits never expire. TTS 1 credit/character (Kokoro 0.5, Edge 0.1), translation 1 credit per 10 characters, video 1,000–10,000 credits, image 500–15,000 credits, music 25,000 credits per generation.

Yes — every plan includes a full commercial license. Use generated audio, video, images and music for YouTube, podcasts, ads, e-learning, marketing or any other commercial project.

Yes. The REST API at /api/v1 covers TTS, voice cloning, STT, voice changer, dubbing, dialogue, video (Veo 3 + Grok), image generation, music, translation and stock video. Authenticate with an sk_ key from the dashboard. Reference at /api-docs and an AI-readable mirror at /llms.txt.

Translation adapts idioms, expressions and cultural references — not just word-for-word. Bulk upload up to 10 files at once, import a YouTube transcript by URL, add custom instructions, download individually or as a ZIP.

Yes. Authentication runs on Supabase, payments through Stripe and Cryptomus (card details never touch our servers), assets live in a private bucket and are auto-deleted after 48 hours. GDPR-aware; your content is not used to train third-party models.

Ready to create with AI?

Join thousands of creators and businesses. No subscription required.