Upscale compressed audio like Sony DSEE or Samsung UHQ using AI
- AI-Powered Upscaling - Restores lost high frequencies (MP3 → Lossless-like)
- Cross-Platform - Works on Windows/macOS/Linux
- Two Interfaces
- CLI for power users.
- Streamlit GUI because it's prettier.
- Python 3.8+
- FFmpeg (for MP3 support)
sudo apt install ffmpegbrew install ffmpegchoco install ffmpeggit clone https://github.com/yourusername/audio_upscaler.git
cd audio_upscalerpip install -r requirements.txtpython -c "from demucs.pretrained import get_model; get_model(name='htdemucs')"# Single file
python main.py input.mp3 output.wavstreamlit run app.pyOpens http://localhost:8501 in your browser.
Fix: Add Python scripts folder to PATH
echo 'export PATH="$HOME/.local/bin:$PATH"' >> ~/.bashrc && source ~/.bashrc C:Users\<YOU>\AppData\Roaming\Python\Python3XX\Scripts
Edit enhance.py:
device = torch.device("cpu") # Change from "cuda"- Convert stereo files to mono first using Audacity/FFmpeg
- Trim files to <10 minutes if processing fails
audio_upscaler/
├── input/ # Default input folder
│ └── sample.mp3 # Example file
├── output/ # Enhanced audio saved here
├── src/
│ ├── enhance.py # Core AI processing
│ ├── main.py # CLI interface
│ └── app.py # Streamlit GUI
├── requirements.txt # Dependencies
└── README.md # This file
A: Typical results:
- MP3 128kbps → Sounds like ~256kbps
- MP3 320kbps → Near-CD quality
A: WAV is uncompressed. Use ffmpeg to convert to FLAC for smaller files:
ffmpeg -i output.wav -compression_level 12 output.flacA: Yes! MIT License allows any use (credit appreciated but not required).