Frequently Asked Questions

DubVoice.ai is an all-in-one AI content platform — text-to-speech with 10,500+ voices, voice cloning, AI video (Veo 3 + Grok), 9 AI image models, AI music (Suno V5), speech-to-text, voice changer, auto dubbing into 50+ languages, stock video, translation in 36+ languages, and a full REST API. One credit balance covers everything.

Paste your text (or upload TXT, DOCX, PDF, SRT files), pick from 10,500+ ElevenLabs / Minimax / Edge TTS / Kokoro voices, tune stability and speed, and download MP3 in seconds. Edge TTS is 90% cheaper than ElevenLabs and Kokoro is 50% cheaper — great for bulk work.

Record or upload a 10-second clean voice sample, give it a name, and the cloned voice is ready to use in TTS, dialogue, and dubbing. Clones live in your account, never expire, and the commercial license is included.

Pick Veo 3 Fast / Lite / Quality (Google) or Grok Imagine (xAI), describe the scene in 10–2,000 characters, optionally attach reference images. We render 8-second 720p / 1080p clips in 16:9 (or 9:16 on Veo 3.1 Lite for Reels / Shorts / TikTok). Pricing 1,000–10,000 credits depending on the variant.

Choose from 9 image models — Nano Banana family, GPT Image 1.5/2, FLUX 2 Pro, Grok Image, Seedream 4.5 and Seedream 5 Lite — pick an aspect ratio, optionally attach up to 4 reference images for image-to-image, and download. 500 credits (Nano Banana 2) up to 15,000 credits (GPT Image 1.5 high quality).

Suno V5 — give it a style and a mood, optionally write lyrics or let the AI do it, and you get 2 full-length tracks (~4 minutes each). 25,000 credits per generation, royalty-free, commercial license included.

Speech-to-Text (up to 200MB → JSON + SRT, 1,000 credits/minute), Voice Changer (transform audio into any of 4,900+ target voices, 2,000 credits/minute), Auto Dubbing (50+ languages with synced SRT, 30,000 credits/task), and Text-to-Dialogue for multi-speaker conversations.

50+ languages for text-to-speech and auto dubbing (English, Spanish, French, German, Italian, Portuguese, Turkish, Japanese, Chinese, Korean, Arabic, Hindi, and many more), 36+ for AI translation.

Credit packages: Starter $4.99 (250K), Standard $11.99 (1M), Pro $24.99 (3M), Business $34.99 (10M), Enterprise $64.99 (20M). No subscriptions — credits never expire. TTS 1 credit/character (Kokoro 0.5, Edge 0.1), translation 1 credit per 10 characters, video 1,000–10,000 credits, image 500–15,000 credits, music 25,000 credits per generation.

Yes — every plan includes a full commercial license. Use generated audio, video, images and music for YouTube, podcasts, ads, e-learning, marketing or any other commercial project.

Yes. The REST API at /api/v1 covers TTS, voice cloning, STT, voice changer, dubbing, dialogue, video (Veo 3 + Grok), image generation, music, translation and stock video. Authenticate with an sk_ key from the dashboard. Reference at /api-docs and an AI-readable mirror at /llms.txt.

Translation adapts idioms, expressions and cultural references — not just word-for-word. Bulk upload up to 10 files at once, import a YouTube transcript by URL, add custom instructions, download individually or as a ZIP.

Yes. Authentication runs on Supabase, payments through Stripe and Cryptomus (card details never touch our servers), assets live in a private bucket and are auto-deleted after 48 hours. GDPR-aware; your content is not used to train third-party models.