LLM Knowledge Retrieval System (RAG)

AI-powered document Q&A system that lets you upload PDFs and ask questions with source citations, using Retrieval-Augmented Generation (RAG).

Demo

Upload a PDF → Ask a question → Get an answer + cited snippets

(Add screenshots / short GIF here)

Key Features

✅ Upload PDF documents
✅ Ask natural-language questions
✅ Answers include source citations (document + chunk)
✅ Persistent vector store (data survives restart)
✅ Simple REST API (FastAPI)
✅ Optional lightweight web UI (if you built one)

Tech Stack

Backend: FastAPI (Python)
Vector DB: ChromaDB
Embeddings: OpenAI text-embedding-3-small
LLM: gpt-4o-mini

How It Works (RAG Pipeline)

Indexing

PDF → Text extraction
Chunking (split into small overlapping text blocks)
Embeddings (convert chunks into vectors)
Store vectors + metadata in ChromaDB

Querying

User question → Embedding
Similarity search in ChromaDB (top-k chunks)
Retrieved chunks → injected as context into the LLM prompt
LLM generates answer + citations

Future Improvements

Support more file types (DOCX, TXT)
Better chunking strategies
Conversation memory
Multiple document queries

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Knowledge Retrieval System (RAG)

Demo

Key Features

Tech Stack

How It Works (RAG Pipeline)

Indexing

Querying

Future Improvements

About

Uh oh!

Releases

Packages

arnavbee/llm-knowledge-retrieval-system

Folders and files

Latest commit

History

Repository files navigation

LLM Knowledge Retrieval System (RAG)

Demo

Key Features

Tech Stack

How It Works (RAG Pipeline)

Indexing

Querying

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages