Question 1

What is Multi-Voice Dialogue mode?

Accepted Answer

Multi-Voice Dialogue mode is a VoisLabs feature that converts a conversation script into audio spoken by two or more distinct AI voices. You write or paste a script with speaker labels, assign a voice from the 13-voice catalog to each speaker, and VoisLabs generates a single stitched audio file with each speaker sounding genuinely different. The output is a finished audio file you can download and publish.

Question 2

Which voices create the best contrast for a two-host podcast?

Accepted Answer

Strong two-host pairs include: Amit (Sadachbia, deep authoritative) + Priya (Sulafat, warm storyteller) — great for Hindi and English; Deepak (Charon, smooth mid-low) + Isha (Kore, clear high-energy) — works well for interview formats; Arjun (Enceladus, sharp investigative) + Naina (Vindemiatrix, calm elegant) — documentary and explainer tone. Avoid pairing two voices in the same pitch register — listeners will struggle to distinguish speakers.

Question 3

Does Multi-Voice Dialogue work in Hindi and Indian languages?

Accepted Answer

Yes. Dialogue mode works natively in Hindi (Devanagari), Tamil, Telugu, Malayalam, Kannada, Bengali, Marathi, Punjabi, Assamese, and Indian English. Write each speaker turn in the native script for that language. The AI generates with correct native pronunciation and prosody — no transliteration workarounds needed.

Question 4

Can I use Multi-Voice Dialogue for audiobooks with character voices?

Accepted Answer

Yes. The standard approach for AI-generated audiobooks uses a stable narrator voice for prose and scene-setting, then switches to distinct character voices for spoken dialogue. Assign the narrator role to a consistent voice (Priya or Naina work well), then assign contrasting voices to recurring characters. The output is a seamless audio file with natural voice transitions at each speaker change.

Question 5

How long does it take to generate a multi-voice episode?

Accepted Answer

Generation time scales with total audio duration, not with the number of speaker turns. A 10-minute two-host episode typically generates in 15–30 seconds. A 30-minute audiobook chapter generates in under 2 minutes. All turns are processed in a single pass — you do not need to generate each speaker separately and then stitch manually.

Question 6

Is the output commercial-ready for Spotify and Apple Podcasts?

Accepted Answer

Yes. VoisLabs paid plans generate broadcast-quality WAV output and include a commercial license covering podcast distribution revenue, YouTube monetization, audiobook sales, and sponsorship income. Normalize your WAV to -16 LUFS for podcast platforms and -14 LUFS for YouTube before uploading.

Multi-Voice Dialogue Mode

13 Acoustically Distinct Voices

Works in All Indian Languages

Per-Speaker Tone Control

Seamless Stitching

HD WAV Download

Podcast, Interview, Audiobook Formats

How It Works

Write your dialogue script

Assign voices to speakers

Generate and download

Frequently Asked Questions

Two voices. One seamless audio file.

Explore More Features