Skip to content
#

mlx-lm

Here are 21 public repositories matching this topic...

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

  • Updated Feb 2, 2026
  • Python

This is the part 2 of a GraphRAG system, in which the user interacts with the data through 2 data structures: vector database (Chroma DB) and graph database (Neo4j). I exploit the probabilism of vector embedding and the determinism of a knowledge graph to minimise hallucination and maximise explainability. The domain: ‘Electronic’ music genre.

  • Updated Jan 21, 2026
  • Python

Prompt LLM Bench is a platform that discovers compatible Hugging Face models on-the-fly, runs reproducible multi-model evaluations, and recommends the optimal prompt–LLM pair based on accuracy, latency, and resource efficiency.

  • Updated Jan 29, 2026
  • TypeScript

Improve this page

Add a description, image, and links to the mlx-lm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mlx-lm topic, visit your repo's landing page and select "manage topics."

Learn more