TTS Client

Request queue: Text sentences enqueued by the session manager
Worker thread: Dequeues text, calls the TTS API, produces audio chunks
Response queue: Audio chunks ready for resampling and WebSocket delivery

File: main_logic/tts_client.py (~2300 lines)

The TTS client handles text-to-speech synthesis across multiple providers with a unified queue-based interface.

Factory function

python

from main_logic.tts_client import get_tts_worker

worker = get_tts_worker(config)

Creates a TTS worker configured for the active provider and voice settings.

Provider	Module	Features
DashScope CosyVoice	Cloud	High quality, voice cloning, streaming
DashScope TTS V2	Cloud	Lower latency variant
GPT-SoVITS	Local	Fully offline, customizable
Custom	HTTP	Any OpenAI-compatible TTS endpoint

The TTS client uses a producer-consumer pattern:

When the user interrupts: