Bisik

An AI-powered tool to evaluate and improve your pronunciation.

Features

Real-time pronunciation evaluation using AI
Word-level accuracy scoring with detailed feedback
Phonetic transcription (IPA) comparison
Visual feedback with color-coded results
Support for multiple languages
REST API for integration

How It Works

Record - Speak the provided text or your own phrase
Analyze - AI processes your speech and compares it to expected pronunciation
Get Feedback - Receive detailed word-by-word analysis with phonetic breakdowns

Supported Languages

English (en)

The system is extensible and can support additional languages by adding new phoneme converters.

Technology Stack

Backend: Flask (Python web framework)
Speech Recognition: OpenAI Whisper (state-of-the-art ASR)
Phonetic Analysis: Epitran & Panphon (IPA conversion and comparison)
ML Framework: PyTorch (for running Whisper models)
Architecture: Clean layered architecture with dependency injection

Prerequisites

Python 3.9 or higher
2-4 GB disk space (for Whisper models)
Microphone access (for recording)

Documentation

Quick Setup

Create virtual environment

python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

Setup environment

cp .env.example .env

Download models (optional, will download on first use)

python scripts/download_models.py

Initialize databases

python scripts/setup_database.py

Run the application

python app.py

The application will be available at http://localhost:3000

Usage

Web Interface

Open your browser to http://localhost:3000
Click "Start Recording" and speak the provided text
Click "Stop Recording" when finished
Click "Evaluate Pronunciation" to get your results
Review detailed feedback including:
- Overall pronunciation accuracy score
- Word-by-word comparison
- Expected vs actual phonetic transcription (IPA)
- Color-coded feedback (green = correct, yellow = close, red = needs work)

API Usage

The application provides a REST API for programmatic access:

curl -X POST http://localhost:3000/api/evaluate \
  -F "[email protected]" \
  -F "expected_text=Hello world" \
  -F "language=en"

See API Reference for complete documentation.

Testing

# Run all tests with coverage
pytest --cov=src tests/

# Run specific test file
pytest tests/unit/test_phonetics.py

# Quick integration tests
python test_upload.py
python test_evaluation.py

Production Deployment

For production environments, use gunicorn:

gunicorn -w 4 -b 0.0.0.0:3000 'web.app_factory:create_app()'

License

This project is available for educational and personal use.

Acknowledgments

OpenAI Whisper - Automatic speech recognition
Epitran - Phonetic transcription
Panphon - Phonological feature vectors

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bisik

Features

How It Works

Supported Languages

Technology Stack

Prerequisites

Documentation

Quick Setup

Usage

Web Interface

API Usage

Testing

Production Deployment

License

Acknowledgments

About

Uh oh!

Releases 1

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
config		config
databases		databases
docs		docs
scripts		scripts
src		src
tests		tests
web		web
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
test_evaluation.py		test_evaluation.py
test_upload.py		test_upload.py

silalahi/bisik

Folders and files

Latest commit

History

Repository files navigation

Bisik

Features

How It Works

Supported Languages

Technology Stack

Prerequisites

Documentation

Quick Setup

Usage

Web Interface

API Usage

Testing

Production Deployment

License

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Uh oh!

Languages

Packages