Machine Learning Algorithms from Scratch

Overview

This repository is a long-term, evolving project focused on implementing Machine Learning algorithms from scratch and with optimized/library-based approaches side by side.

The main idea is not to provide a fixed, final framework, but to continuously grow a collection of ML algorithms while exploring:

How algorithms work internally (pure Python / minimal dependencies)
How performance improves using optimized numerical libraries
How these implementations compare to well-established libraries

This repository will be updated over time as new algorithms, datasets, benchmarks, and experiments are added.

Goals

Build a deep understanding of ML algorithms by implementing them manually
Compare clarity vs performance trade-offs
Keep implementations readable, educational, and reproducible
Provide lightweight benchmarks for practical comparison

Philosophy

No heavy abstractions
No premature optimization
Algorithms are implemented incrementally
Structure may evolve as the project grows

This repository is intended as a learning-oriented and experimental codebase,
not a production-ready ML framework.

Project Structure

The repository is organized by learning paradigm and algorithm family:

supervised/
- Classification algorithms (e.g. KNN)
unsupervised/
- Clustering algorithms (e.g. K-Means)
datasets/
- Synthetic datasets used for benchmarks
benchmarks/
- Lightweight benchmark scripts for performance comparison

Each algorithm typically includes:

A pure Python implementation
An optimized (NumPy-based) implementation
A scikit-learn reference wrapper (when applicable)

All implementations aim to expose a similar API for fair comparison.

Installation & Environment Setup (Windows)

For convenience and reproducibility, this repository provides a Windows batch installer that automates environment setup.

The installer will:

Detect all Python installations available in PATH
Let you choose which Python version to use
Create a local virtual environment (.venv)
Upgrade pip and install project dependencies if requirements.txt is present

How to run the installer

From the repository root:

Double-click on Installer.bat:
Run it from Command Prompt using:
```
Installer.bat
```

You will be prompted to select a Python version if multiple versions are installed.

Activating the virtual environment

After the installer finishes:

call .venv\Scripts\activate

Benchmarks

Each algorithm directory contains benchmark scripts that compare:

Pure Python implementation
Optimized NumPy implementation
Scikit-learn reference implementation

Benchmarks are intentionally lightweight and focus on:

Relative performance
Algorithmic differences
Implementation overhead

They are not intended as definitive performance measurements.

Status

🚧 Actively evolving — expect frequent changes, refactors, and new additions.

Notes

Some sections or directories may be empty or incomplete at times
APIs are not guaranteed to be stable
Benchmarks focus on relative comparison, not absolute performance
This repository prioritizes learning clarity over production robustness

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
supervised		supervised
unsupervised		unsupervised
Installer.bat		Installer.bat
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine Learning Algorithms from Scratch

Overview

Goals

Philosophy

Project Structure

Installation & Environment Setup (Windows)

How to run the installer

Activating the virtual environment

Benchmarks

Status

Notes

License

About

Uh oh!

Languages

License

AliJ-Official/ML-Algorithms

Folders and files

Latest commit

History

Repository files navigation

Machine Learning Algorithms from Scratch

Overview

Goals

Philosophy

Project Structure

Installation & Environment Setup (Windows)

How to run the installer

Activating the virtual environment

Benchmarks

Status

Notes

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages