Question 1

What can I do with DubVoice.ai?

Accepted Answer

DubVoice.ai is an all-in-one AI content platform. You can convert text to speech with 10,500+ voices, clone any voice from a 10-second sample, generate AI videos (Veo 3, Grok), create images with 9 different AI models, produce original music with Suno V5, transcribe audio to text and SRT, change voices, dub audio into 50+ languages, browse a stock video library, translate content in 36+ languages, and integrate everything via our developer REST API.

Question 2

How does text-to-speech work?

Accepted Answer

Paste your text (or upload TXT, DOCX, PDF, SRT files), pick a voice from 10,500+ ElevenLabs / Minimax / Edge TTS voices, tweak stability and speed, and download high-quality MP3 in seconds. Edge TTS is 90% cheaper than ElevenLabs — useful for bulk work.

Question 3

How does AI voice cloning work?

Accepted Answer

Upload a 10-second clean recording of any voice, give it a name, and the cloned voice is ready to use in TTS, dialogue, and dubbing. All your clones live in your account and never expire. Commercial license is included.

Question 4

How does AI video generation work?

Accepted Answer

Pick a model (Veo 3 Fast / Lite / Quality from Google, or Grok Imagine from xAI), describe what you want in 10–2,000 characters, and optionally attach reference images. We render 8-second 720p / 1080p clips in 16:9 (or 9:16 on Veo 3.1 Lite for Reels / Shorts / TikTok). Costs range from 1,000 credits (Veo 3.1 Lite) to 10,000 credits (Veo 3.1 Quality).

Question 5

How does AI image generation work?

Accepted Answer

Choose from 7 image models — Nano Banana 2 Lite, Nano Banana 2, Grok Image, Meta AI, Nano Banana Pro, FLUX 2 Pro and GPT Image 2 — pick an aspect ratio, optionally attach reference images for image-to-image, and download. Costs run from 500 credits (Nano Banana 2 Lite) to 15,000 credits (GPT Image 2).

Question 6

How does AI music generation work?

Accepted Answer

DubVoice.ai runs Suno V5 — give it a style and a mood, optionally write your own lyrics or let the AI write them, and you get 2 full-length tracks (~4 minutes each) per generation. 25,000 credits per generation, royalty-free, commercial license included.

Question 7

What audio tools are included?

Accepted Answer

Speech-to-Text (transcribe up to 200MB MP3/WAV/M4A/FLAC into JSON + SRT, 1,000 credits/minute), Voice Changer (transform any audio into any of 4,900+ target voices, 2,000 credits/minute), Auto Dubbing (translate audio into 50+ languages with synced SRT, 30,000 credits per task), and Text-to-Dialogue (multi-speaker conversations with 2-6 voices).

Question 8

What is the Stock Video feature?

Accepted Answer

Provide a topic + narration text and DubVoice.ai stitches matching footage from a stock library, optionally time-aligned to your own narration audio. 500 credits per 1,000 characters.

Question 9

What languages are supported?

Accepted Answer

50+ languages for text-to-speech and auto dubbing (English, Spanish, French, German, Italian, Portuguese, Turkish, Japanese, Chinese, Korean, Arabic, Hindi and many more) and 36+ for AI translation.

Question 10

How much does it cost?

Accepted Answer

Credit packages: Starter $4.99 (250K credits), Standard $11.99 (1M), Pro $24.99 (3M), Business $34.99 (10M), Enterprise $64.99 (20M). No subscriptions — buy once, credits never expire. TTS costs 1 credit per character (Edge = 0.1), translation 1 credit per 10 characters, Veo 3 video 1,000–10,000 credits, image generation 500–15,000 credits per image depending on model, music 25,000 credits per generation. Crypto payments accepted via Cryptomus.

Question 11

Can I use the generated content commercially?

Accepted Answer

Yes — every plan includes a full commercial use license. Use generated audio, video, images and music for YouTube, podcasts, ads, e-learning, marketing or any other commercial project.

Question 12

Do you offer an API?

Accepted Answer

Yes. Our REST API at /api/v1 covers text-to-speech, voice cloning, speech-to-text, voice changer, auto dubbing, dialogue, video (Veo 3 + Grok), image generation, music, translation and stock video. Authenticate with an sk_ key from your dashboard. Full reference at /api-docs and an AI-readable mirror at /llms.txt.

Question 13

How does AI translation work?

Accepted Answer

Our translation goes beyond word-for-word — it adapts idioms, expressions and cultural references naturally. Upload files in bulk (up to 10 at once), import a YouTube transcript by URL, add custom instructions, and download translations individually or as a ZIP.

Question 14

Is my data secure?

Accepted Answer

Yes. Authentication runs on Supabase, payments go through Stripe and Cryptomus (card details never touch our servers), assets are stored in a private Supabase bucket and automatically deleted after 48 hours. We are GDPR-aware and your content is not used to train third-party models.

Frequently Asked Questions