{ "messages": [ { "role": "system", "content": "You are a helpful AI assistant with access to tools. Use tools when they help answer the user's question. Be concise and accurate." }, { "role": "user", "content": "Can you speak out loud? What TTS do you have?" }, { "role": "assistant", "content": "\nTTS tools — text-to-speech via the madcat-tts daemon. GPU-accelerated, multiple engines and voices.\n\n\nYes. The TTS system is madcat-tts — a GPU-accelerated speech synthesis daemon. Five tools:\n\n- **tts_speak** — synthesize and play speech locally. Accepts text, optional voice ID or cart tag, language override, and format (wav/mp3/flac). Can wait for playback to finish or fire-and-forget.\n- **tts_list** — list all available carts and voices with their engines, languages, and tags.\n- **tts_active** — show the currently active cart and voice.\n- **tts_switch_voice** — change the active voice by ID or cart tag.\n- **tts_health** — check daemon health, engine availability, and active cart.\n\nThe system supports multiple TTS engines:\n- **Chatterbox** — voice cloning engine, high quality\n- **Piper** — fast CPU-based synthesis, low latency\n- **XTTS** — remote high-quality synthesis\n\nVoices are organized into carts (e.g. `bt7274` cart has `bt7274-en` and `bt7274-pl` voices). Each cart can have voices in different languages and engines. The voice parameter accepts a voice ID (e.g. `bt7274-pl`), a cart tag (e.g. `bt7274` to use its default), or omit for the currently active voice." } ] }