Neural Text-to-Speech
Checking…
Qwen3-TTS and Kokoro voices on MLX or CUDA backends. Streaming audio with Preston-Blair visemes.
Real-time factor—
Cache hit—
Qwen3-TTS
ddx_adam
en · male · en-US
ddx_bella
en · female · en-US
ddx_heart
en · female · en-US
Text Live waveform
Active visemesilA/V syncAudio @0.00s
Visemes @0.00s
Δ+0 ms