Phentrieve

Phentrieve is an advanced AI-powered system for mapping clinical text to Human Phenotype Ontology (HPO) terms using a Retrieval-Augmented Generation (RAG) approach. It supports multiple languages and offers robust tools for benchmarking, text processing, and HPO term retrieval.

For comprehensive documentation, please visit the Phentrieve Documentation Site.

Key Features

Multilingual HPO term mapping using state-of-the-art embedding models
Advanced text processing pipeline including semantic chunking and assertion detection
Extensive benchmarking framework for model evaluation and comparison
User-friendly interfaces: CLI, FastAPI backend, and Vue.js frontend
Support for cross-encoder re-ranking to improve retrieval precision

Benchmark Results

Performance on 570 German clinical terms (BioLORD-2023-M model):

Retrieval Mode	MRR	Hit@1	Hit@10	Ont Sim@1
Single-vector	0.695	55.8%	94.0%	79.9%
Multi-vector (all_max)	0.892	84.0%	97.4%	91.9%

+28% MRR improvement with multi-vector retrieval using label, synonym, and definition embeddings.

Quick Start

Install Phentrieve using pip:

pip install phentrieve

For detailed setup and usage instructions, including Docker deployment, please see our Getting Started Guide.

Basic Usage

# Launch interactive query mode
phentrieve query --interactive

# Process clinical text to extract HPO terms
phentrieve text process "The patient exhibits microcephaly and frequent seizures."

Discover more commands and options in the User Guide.

Docker Deployment

Deploy Phentrieve using Docker Compose for production environments:

# Linux: Setup volume permissions (required)
sudo ./scripts/setup-docker-volumes.sh

# macOS/Windows: No setup needed, skip to next step

# Start services
docker-compose up -d

# Access the application
# - API: http://localhost:8000
# - Frontend: http://localhost:8080

For detailed deployment instructions, security best practices, and troubleshooting, see the Docker Deployment Guide.

Full Documentation | Contributing Guide | License

Name		Name	Last commit message	Last commit date
Latest commit History 392 Commits
.github		.github
api		api
data		data
docs		docs
frontend		frontend
phentrieve		phentrieve
plan		plan
scripts		scripts
tests		tests
.dockerignore		.dockerignore
.env.docker.template		.env.docker.template
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
mkdocs.yml		mkdocs.yml
phentrieve.yaml		phentrieve.yaml
phentrieve.yaml.template		phentrieve.yaml.template
pyproject.toml		pyproject.toml
setup_phentrieve.sh		setup_phentrieve.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phentrieve

Key Features

Benchmark Results

Quick Start

Basic Usage

Docker Deployment

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

License

berntpopp/phentrieve

Folders and files

Latest commit

History

Repository files navigation

Phentrieve

Key Features

Benchmark Results

Quick Start

Basic Usage

Docker Deployment

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors 5

Uh oh!

Languages

Packages