A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
-
Updated
Jul 12, 2025 - Python
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
An automated installation script for deploying Kyutai's Moshi STT server on macOS Apple Silicon.
Kotai is a fully local, zero-cost voice assistant that combines the power of Kyutai TTS/STT, LiveKit, and local LLMs to create natural conversational experiences.
Demo repository for Kyutai Labs' STT-1B model: Real-time speech-to-text transcription with streaming inference, built-in VAD, and Jupyter notebook examples for audio processing and simulation.
LiveKit TTS plugin with Kyutai streaming implementation
Golang bindings to Kyutai Delayed Streams Modeling Rust productions servers
A FastAPI-based Speech-to-Text service that provides OpenAI Whisper API compatibility using Kyutai's powerful STT models. This allows you to use any OpenAI Whisper client with Kyutai's models as a drop-in replacement.
A high-performance, GPU-optimized real-time speech-to-text (STT) streaming server built with WebSocket support for multiple concurrent clients. This project leverages the Kyutai STT model and is optimized for NVIDIA RTX 4090 GPUs, providing low-latency transcription for audio streams.
Working integration with Kyutai and the Omi app.
Add a description, image, and links to the kyutai topic page so that developers can more easily learn about it.
To associate your repository with the kyutai topic, visit your repo's landing page and select "manage topics."