Frequently Asked Questions
DubVoice.ai is an all-in-one AI content platform. You can convert text to speech with 500+ premium voices, clone any voice with AI, transcribe audio to text (Speech-to-Text with JSON & SRT output), isolate vocals from audio, change voices with AI, generate AI videos with Veo 3, create images with 9 AI models including Seedream 4.5 & 5 Lite, generate original music with AI, translate content in 36+ languages, write long-form content with AI, and integrate everything via our developer API.
Paste your text (or upload TXT, DOCX, PDF, SRT files), choose from 500+ AI voices powered by 3 TTS providers, customize voice settings like speed, stability, and style, and generate high-quality MP3 or WAV audio in seconds. Supports 50+ languages.
Upload or record just 5-15 seconds of audio and our AI will clone the voice. You can then use your cloned voice in the built-in TTS playground to generate speech from any text. Each user's cloned voices are private and only visible to them.
For videos, use Veo 3 to generate videos from text prompts with real-time progress tracking and instant download. For images, choose from 9 AI models including Seedream 4.5, Seedream 5 Lite, GPT Image 1.5, Nano Banana 2, Flux 2 Pro and more. Set your aspect ratio and optionally upload reference images for style transfer.
We support 36+ languages for translation and 50+ for text-to-speech, including English, Spanish, French, German, Italian, Portuguese, Turkish, Japanese, Chinese, Korean, Arabic, Hindi, and many more.
Credit packages start at $4.99 for 250K credits. No subscriptions — buy once, use anytime. TTS costs 1 credit per character, translation costs 1 credit per 2 characters. AI video costs 600-1,000 credits. Image generation costs 1,000-10,000 credits depending on the model. Speech-to-Text costs 1,000 credits per minute. Voice Changer costs 2,000 credits per minute. Voice Isolation costs 10,000 credits. AI music costs 10,000 credits per track.
Yes! All plans include a commercial use license. Use generated audio, video, images, and text for YouTube, podcasts, ads, e-learning, marketing, and any commercial project.
Yes! Our REST API supports text-to-speech, translation, voice listing, and account management. Create secure API keys from your dashboard, with full documentation and code examples available.
Describe your topic and choose a content length (1K to 50K characters). The AI generates professional scripts, articles, guides, or marketing copy. You can also paste a YouTube URL to extract the topic and generate content from it.
Our translation goes beyond word-for-word — it adapts idioms, expressions, and cultural references naturally. Upload files in bulk (up to 10 at once), import YouTube transcripts, add custom instructions, and download translations individually or as a ZIP.
Upload any audio file (MP3, WAV, FLAC, OGG, AAC, and more — up to 200MB) and our AI will transcribe it to text. You get both a JSON transcript and SRT subtitle file. Costs 1,000 credits per minute of audio. Credits are refunded if processing fails.
Voice Isolation removes background noise and music from any audio recording, leaving only the clean voice. Voice Changer transforms a voice recording into any of 4,900+ target voices using Speech-to-Speech AI while preserving the original speech patterns and timing. Voice Isolation costs 10,000 credits per use, Voice Changer costs 2,000 credits per minute.
Absolutely. Your content is processed securely and we use industry-standard encryption. We are GDPR compliant and your data is never shared with third parties.