Building a better karaoke experience - open source tools for singers, KJs, and karaoke businesses.
Website • Generator • Song Finder • YouTube
We're creating tools to generate high-quality karaoke videos fully automatically in under 30 minutes - professional results that rival commercial karaoke tracks.
Our technology stack handles the entire pipeline:
- Audio separation - Isolate vocals from instrumentals using state-of-the-art AI models
- Lyrics transcription - Word-level synchronized lyrics with automatic correction
- Video rendering - Professional 4K karaoke videos with multiple output formats (CDG, MP4, LRC, and more)
Getting lyrics synced with the right timestamps reliably for any song is genuinely difficult. No existing tooling does it well consistently. We've gotten close by combining:
- Advanced audio separation models
- Proprietary and open-source transcription
- LLM-powered correction workflows
This complexity is why we offer both open source tools (use and modify freely) and a hosted service (for those who want reliable results without the infrastructure overhead).
| Project | Description | Stars |
|---|---|---|
| python-audio-separator | Easy-to-use stem separation (vocals/instrumentals) from CLI or Python, using pre-trained UVR models | |
| python-lyrics-transcriber | Create synchronized lyrics files (ASS/LRC) with word-level timestamps using Whisper + LLM correction | |
| karaoke-gen | Full karaoke video generation pipeline - download, separate, sync, render, upload |
| Project | Description |
|---|---|
| karaokehunt-app | Build the ideal karaoke playlist based on your music taste and friends |
| lyrics-from-genius | Fetch lyrics from Genius for songs or entire artists |
| karaoke-lyrics-processor | Process lyrics for karaoke production (line splitting, formatting) |
| flacfetch | CLI tool to fetch high-quality audio from various sources |
Everything we build is open source. We believe in community and sharing what we create.
That said, building reliable fully-automated karaoke generation requires significant compute resources (GPU for separation, LLM tokens for correction, CPU for rendering). Each track costs several dollars to generate at production quality.
So we offer both paths:
- Self-host: Run everything yourself with our open source tools
- Hosted service: Pay a few dollars per track at gen.nomadkaraoke.com for hassle-free generation
If you can improve our open source tools to work reliably without expensive APIs or LLMs, we actively welcome that contribution!
Contributions are very welcome! We're a small team and appreciate help from the community.
How to contribute:
- Fork the repository you want to work on
- Make your changes
- Submit a pull request
We'll review and merge promptly. All our projects use the MIT License - free for anyone to use and modify.
Ways to help:
- Improve model accuracy or performance
- Add new output formats or integrations
- Fix bugs and improve documentation
- Share your karaoke creations!
If the maintenance workload ever becomes too much, we'll ask for volunteers to share maintainership - though we don't expect that to happen.
- Questions or feedback? Open an issue on the relevant repo
- Want to discuss ideas? Book a call with Andrew
- Stay updated on releases? Sign up for updates
- Business inquiries? Email [email protected]