Back to Blog
Announcement
AnnouncementApril 21, 202615 min read

Everything You Can Do with DubVoice.ai — Full Platform Feature Guide (2026)

DubVoice.ai is more than a text-to-speech tool — it's a complete AI content creation platform. Whether you're a YouTuber, marketer, developer, or business owner, DubVoice.ai gives you everything you need to create professional voice, video, image, music, and written content at a fraction of traditional costs.

Here's a complete breakdown of every feature available on the platform in 2026.

AI Text-to-Speech (TTS)

Our flagship feature turns any text into natural, human-sounding speech in seconds.

What You Get

  • 500+ Premium AI Voices — Male, female, and diverse voices from 3 TTS providers (ElevenLabs, Minimax, Edge TTS)
  • 50+ Languages — English, Spanish, French, German, Japanese, Chinese, Arabic, Hindi, Turkish, and many more
  • Multiple AI Models — Eleven Multilingual v2 (recommended), Turbo v2.5 (fastest), Flash v2.5 (ultra-fast), v3 (alpha, 70+ languages)
  • Full Voice Customization — Adjust stability, clarity, speed, speaker boost, and style exaggeration
  • Large Text Support — Process up to 100,000 characters with automatic smart chunking
  • File Upload — Drop in TXT, DOCX, PDF, RTF, or SRT files and convert them directly to speech
  • Download as MP3 or WAV — Industry-standard formats ready for any project
  • Save Your Favorite Voices — Bookmark voices for instant access

Cost

1 character = 1 credit (Edge TTS: 2 characters = 1 credit)

AI Voice Cloning

Clone any voice in seconds with just a short audio sample.

What You Get

  • Quick Cloning — Upload or record 5-15 seconds of audio to clone any voice
  • Browser Recording — Record audio directly in your browser
  • TTS Playground — Test your cloned voice instantly by generating speech
  • Shared Voice Library — All cloned voices available for every user
  • Background Processing — Long texts processed with progress tracking
  • Multiple Formats — WAV, MP3, WebM (up to 16MB), up to 50,000 characters

Cost

1 character = 1 credit (voice cloning itself is free)

AI Video Generation — 6 Providers

Create videos from text prompts using the latest AI models — no editing software required.

Veo 3 (Google DeepMind)

Google's flagship video AI. Generate cinematic videos from text prompts with natural motion, lighting, and physics.

  • Veo 3.1 HD — Full quality, 1080p (10,000 credits)
  • Veo 3.1 Fast — Faster generation, 720p (2,000 credits)
  • Veo 3.1 Lite — Budget-friendly option (1,800 credits)
  • Video Extend — Extend any generated video (2,000 credits per extension)

Grok Imagine (xAI)

xAI's video model with fun, creative, and spicy generation modes.

  • Text-to-Video — 6-30 second videos, 480p/720p
  • Image-to-Video — Animate any image into video
  • 3 Creative Modes — Fun, Normal, Spicy
  • Cost: 1,000 credits per video

Sora 2 (OpenAI)

OpenAI's next-generation video model for high-quality cinematic output.

  • Standard Quality — 10-15 second videos
  • Cost: 100,000 credits per video

Sora 2 Pro (OpenAI)

Premium quality with extended capabilities.

  • Standard & High modes — Maximum visual fidelity
  • Cost: 150,000-200,000 credits per video

Sora 2 Storyboard (OpenAI)

Multi-scene storyboard generation for complex video projects.

  • Multi-Scene Support — Create connected sequences
  • Cost: 150,000-200,000 credits per video

Seedance 2 (ByteDance)

ByteDance's video model with first-frame reference support.

  • 480p/720p resolution options
  • First Frame Support — Start from a specific image
  • Cost: 1,000 credits per video

AI Image Generation — 10 Models

Generate stunning AI images for thumbnails, social media, marketing, and creative projects.

Available Models

  • Nano Banana — Google Gemini, text + edit (2,000 credits)
  • Nano Banana 2 — Gemini 3 Flash Image Preview via GeminiGen.AI, image-to-image up to 8 references (3,000 / 4,000 / 10,000 credits at 1K/2K/4K)
  • Nano Banana Pro — Gemini 3 Pro Image Preview via GeminiGen.AI, image-to-image up to 8 references (3,500 / 6,000 / 15,000 credits at 1K/2K/4K)
  • Grok Image — xAI Grok Imagen via GeminiGen.AI, optional reference uploads (3,000 credits)
  • GPT Image 1.5 — Photorealistic images powered by OpenAI (5,000 credits)
  • Seedream 4 — ByteDance high-quality with text rendering (4,000 credits)
  • Seedream 4.5 — ByteDance 4K ultra-high-quality (5,000 credits)
  • Seedream 5 Lite — ByteDance latest fast & efficient (3,500 credits)
  • FLUX 2 Pro — Black Forest Labs professional quality (5,000 credits)

Features

  • 8 Aspect Ratios — 1:1, 4:3, 3:4, 16:9, 9:16, 21:9, 2:3, 3:2
  • Reference Image Upload — Upload up to 5 images for style-guided generation
  • Quality Settings — Control output quality per model

AI Music Generation (Suno AI)

Create original music tracks with custom lyrics, genre, and mood.

What You Get

  • Custom Lyrics — Write verse, chorus, bridge with structure tags
  • Style & Genre Control — Pop, rock, electronic, jazz, classical, and more
  • Mood Selection — Happy, sad, energetic, calm, epic, and more
  • Full Audio Output — Download complete tracks

Cost

10,000 credits per track

Speech to Text (AI Transcription)

Convert any audio file into text with AI-powered transcription.

What You Get

  • 10+ Audio Formats — MP3, AAC, AIFF, OGG, OPUS, WAV, WEBM, FLAC, M4A (up to 200MB)
  • JSON Transcript — Word-level timestamps
  • SRT Subtitles — Automatic subtitle generation for video editing
  • Copy & Download — Copy text or download JSON/SRT files

Cost

1,000 credits per minute of audio

Voice Tools

Professional audio tools for voice isolation, transformation, and dubbing.

Voice Isolation

Remove background noise, music, and ambient sounds — leaving only clean voice. Perfect for podcasts, interviews, and noisy footage.

Cost: 10,000 credits per use

Voice Changer (Speech-to-Speech)

Transform any voice recording into 4,900+ target voices. The AI preserves speech patterns, emotions, and timing while changing the voice identity.

Cost: 2,000 credits per minute of audio

Audio Dubbing

Dub audio files into different languages while preserving the speaker's voice characteristics and timing.

Cost: 30,000 credits per dubbing task

Dub Translate — AI-Powered Translation

Cultural, context-aware translation in 36+ languages. Not word-for-word — the AI adapts idioms, expressions, and cultural references.

What You Get

  • 36+ Languages with auto-detect
  • File Upload — TXT, DOCX, SRT files
  • Bulk Translation — Up to 10 files at once
  • YouTube Transcript Import — Paste URL, extract, translate
  • Custom Instructions — Save reusable guidelines
  • Download as ZIP — All translations in one file

Cost

1 character = 1 credit

AI Content Writer (FREE)

Generate long-form content — scripts, articles, guides, marketing copy.

What You Get

  • YouTube Import — Extract topics from YouTube URLs
  • Flexible Length — 1K to 50K characters
  • No Credit Cost — Completely free to use

YouTube Channel Management

Full AI-powered YouTube channel creation and management.

  • Channel Creation — Set up and manage YouTube channels
  • Topic Generation — AI-generated topic ideas for your niche
  • Video Creation — End-to-end video production with AI
  • Channel Analytics — Track performance and optimize

Developer API

Available Endpoints

  • POST /api/v1/tts — Text to speech
  • GET /api/v1/tts/:task_id — Check TTS job status
  • GET /api/v1/voices — List all voices
  • POST /api/v1/translate — Translate content
  • POST /api/v1/video — Generate video
  • GET /api/v1/me — Account info & balance

API Features

  • Secure API key authentication (sk_ prefixed)
  • Multiple API keys with custom names
  • Full documentation with code examples
  • All TTS parameters supported
  • Webhook callbacks for async operations

Flexible Credit-Based Pricing

No subscriptions. Buy credits and use them whenever you want.

Credit Packages

  • Starter — $4.99 for 250,000 credits
  • Standard — $11.99 for 1,000,000 credits
  • Pro — $24.99 for 3,000,000 credits (Best Value)
  • Business — $34.99 for 10,000,000 credits
  • Enterprise — $64.99 for 20,000,000 credits

How Credits Work

  • TTS: 1 character = 1 credit
  • Voice Clone TTS: 1 character = 1 credit
  • Translation: 1 character = 1 credit
  • Video: 1,000 - 200,000 credits (Veo 3: 1,800-10,000; Grok/Seedance: 1,000; Sora 2: 100-200K)
  • Image: 100 - 10,000 credits (varies by model)
  • Music: 10,000 credits per track
  • Speech-to-Text: 1,000 credits per minute
  • Content Writer: FREE

Affiliate Program

Earn 10% lifetime commission on every purchase your referrals make. Track referrals, balance, and earnings in real time from the Affiliate dashboard.

Get Started Today

DubVoice.ai combines 20+ AI tools into one platform:

  • AI Text-to-Speech — 500+ voices, 50+ languages
  • AI Voice Cloning — Clone any voice in seconds
  • AI Video — Veo 3, Grok, Sora 2, Seedance
  • AI Images — 10 models, full control
  • AI Music — Original tracks with custom lyrics
  • Speech-to-Text — Transcribe with JSON & SRT
  • Voice Tools — Isolation, changer, dubbing
  • AI Translation — 36+ languages, cultural adaptation
  • AI Content Writer — Scripts, articles, guides (FREE)
  • YouTube Tools — Channel management & automation
  • Developer API — Build voice & video into your products

No subscription required. Sign up, buy credits, and start creating.

Try DubVoice.ai Today

500+ AI voices, 6 video providers, 10 image models, AI music, translation & more — all in one platform. No subscription required.