Clinical-Instruct API

This repository contains a Flask-based API that serves a fine-tuned GPT-2 (355M) language model for clinical reasoning tasks. The fine-tuning was inspired by concepts from Sebastian Rachka’s book “How to Build Large Language Models from Scratch” and applied to the dataset from the Zindi Kenya Clinical Reasoning Challenge.

Project Overview

This project started as a way to put into practice some of the ideas I learned while studying how instruction fine-tuning works at a low level. I decided to revisit a clinical reasoning dataset from a past hackathon and experiment with applying GPT-2 (355M) to it. The model was trained and wrapped into a simple Flask API so that it could be queried directly or integrated into other applications.

During testing, I evaluated the responses using Microsoft Phi-3.5-mini-instruct, and the fine-tuned model reached an automated conversational benchmark score of 68.33 on the test data. While not state-of-the-art, the results showed that the approach worked reasonably well and provided a solid hands-on learning experience. To make interaction easier, I also built a sample dashboard that connects to the API and lets users try out the model through a simple interface.

Setup Instructions

Requirements

Python (≥3.8)
pip (Python package manager)

Steps

Navigate to your desired project directory
```
cd path/to/your/project
```

Create a virtual environment and activate it

On Windows (Command Prompt):

python -m venv myenv
myenv\Scripts\activate.bat

On macOS/Linux:

python -m venv myenv
source myenv/bin/activate

Clone this repository

git clone https://github.com/BetikuOluwatobi/clinical-instruct-api.git

Navigate into the repository
```
cd clinical-instruct-api
```

Install dependencies

python -m pip install -r requirements.txt

Prepare model weights
- Inside the repo, navigate to the static directory.
- Create a folder named weights.
- Download the model weights from the Google Drive and place them into the weights directory.
- ⚠️ If you rename the file, update the name variable in app.py.

Running the API

Start the Flask server on port 3000 (or any port of your choice):

flask --app app run --port 3000

The server will output the localhost address, e.g.:

http://127.0.0.1:3000/

Visiting that address will display the API documentation page.

Testing the API

1. Test via Browser

You can test the API directly in your browser:

http://127.0.0.1:3000/instruct?prompt={your_text_prompt}&max_num_tokens={num_of_tokens_to_generate}&temperature=1&top_k=5

Example:

http://127.0.0.1:3000/instruct?prompt=Explain+the+symptoms+of+malaria&max_num_tokens=200&temperature=1&top_k=5

2. Test via `curl`

curl "http://127.0.0.1:3000/instruct?prompt=Explain+the+symptoms+of+malaria&max_num_tokens=200&temperature=1&top_k=5"

3. Test via Python `requests`

import requests

url = "http://127.0.0.1:3000/instruct"
params = {
    "prompt": "Explain the symptoms of malaria",
    "max_num_tokens": 200,
    "temperature": 1,
    "top_k": 5
}

response = requests.get(url, params=params)
print(response.json())

Dashboard Integration

A simple dashboard app is available for visualizing and interacting with the API: Clinical-Instruct Dashboard

This dashboard queries the API on port 3000 by default.

Notes

Model inference on CPU is slow (≈2 minutes per response for 100 tokens).
For best performance, run on a machine with GPU support.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
static		static
templates		templates
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
model.py		model.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Clinical-Instruct API

Project Overview

Setup Instructions

Requirements

Steps

Running the API

Testing the API

1. Test via Browser

2. Test via `curl`

3. Test via Python `requests`

Dashboard Integration

Notes

About

Uh oh!

Releases

Packages

Languages

License

BetikuOluwatobi/clinical-instruct-api

Folders and files

Latest commit

History

Repository files navigation

Clinical-Instruct API

Project Overview

Setup Instructions

Requirements

Steps

Running the API

Testing the API

1. Test via Browser

2. Test via curl

3. Test via Python requests

Dashboard Integration

Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

2. Test via `curl`

3. Test via Python `requests`

Packages