Fairness Tales Workshop

This repository contains the materials for the PyData London 2025 workshop: How To Measure And Mitigate Unfair Bias in Machine Learning Models.

Overview

AI tools used in hiring can unintentionally perpetuate discrimination in protected characteristics such as age, gender and ethnicity, leading to significant real-world harm. This workshop provides a practical, hands-on approach to addressing biases in machine learning models, using the example of AI-powered hiring tools.

In this workshop, we will:

Generate a synthetic dataset of CVs for software engineers, with controlled distributions across gender and race.
Train a biased model on this dataset to understand how machine learning systems can perpetuate discrimination.
Evaluate fairness metrics to identify and measure bias in the model across different demographic groups.
Apply bias mitigation techniques using the Fairlearn library to address the discovered unfairness.
Compare the trade-offs between model performance and fairness across different mitigation strategies.

By the end of the session, participants will be equipped with the knowledge and tools to tackle bias in their own projects and ensure fairer AI systems.

Getting Started

1. Clone the Repository

git clone https://github.com/john-sandall/fairness-tales-workshop
cd fairness-tales-workshop

2. Create Environment and Install Dependencies

Choose your preferred package manager:

Poetry

poetry install

pip

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

uv

uv venv
uv pip install -r pyproject.toml --all-extras

3. Set Up Environment Variables

To generate the synthetic CV data, you need an OpenAI API key.

cp .env.example .env

Then, edit the .env file to add your API key:

OPENAI_API_KEY="sk-..."

4. Running the Workshop

The workshop consists of two main notebooks:

notebooks/1 - Generate CVs.ipynb: Creates a synthetic dataset of CVs
notebooks/2 - Model.ipynb: Demonstrates bias detection and mitigation techniques

To run the notebooks: jupyter lab

Development

Project cheatsheet

pre-commit: pre-commit run --all-files --hook-stage=manual
poetry sync: poetry install --with dev

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.changelog		.changelog
.github		.github
.vscode		.vscode
data		data
docs		docs
notebooks		notebooks
.env.template		.env.template
.envrc		.envrc
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
.venv		.venv
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fairness Tales Workshop

Overview

Getting Started

1. Clone the Repository

2. Create Environment and Install Dependencies

3. Set Up Environment Variables

4. Running the Workshop

Development

Project cheatsheet

License

About

Uh oh!

Releases

Packages

Languages

License

john-sandall/fairness-tales-workshop

Folders and files

Latest commit

History

Repository files navigation

Fairness Tales Workshop

Overview

Getting Started

1. Clone the Repository

2. Create Environment and Install Dependencies

3. Set Up Environment Variables

4. Running the Workshop

Development

Project cheatsheet

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages