🛡️ RAG Poisoning Lab — Educational AI Security Exercise

This repository contains a hands-on lab environment for learning RAG (Retrieval-Augmented Generation) data poisoning attacks, detection techniques, and mitigation strategies. It is designed for students, researchers, and security practitioners who want practical experience with adversarial manipulation of AI retrieval systems.

⚠️ Educational Purpose Only
This lab is intended strictly for learning, research, and training.
Do not use these techniques on any system you do not own or do not have explicit permission to test.

📘 Overview

The lab demonstrates how a RAG system can be poisoned by injecting malicious documents into a vector database. You will:

Build a simple RAG pipeline using FAISS and Ollama
Launch a Streamlit interface to query clean documents
Execute poisoning attacks by ingesting malicious files
Detect suspicious chunks using perplexity + similarity scoring
Apply mitigations such as keyword filters and state resets

This lab mirrors real-world RAG risks seen in enterprise AI applications.

🚀 Quick Start (Local Setup)

Clone the Repository

git clone https://github.com/r00tb3/RAG-Poisoning-Lab.git && cd RAG-Poisoning-Lab

Create and Activate Virtual Environment

python3 -m venv venv
source venv/bin/activate

Install Dependencies

pip install -r requirements.txt --prefer-binary --no-cache-dir

Install and Start Ollama

curl -fsSL https://ollama.com/install.sh | sh
ollama pull llama3:8b-instruct-q4_K_M
ollama pull nomic-embed-text
ollama serve    # keep this running

Run the Streamlit App

streamlit run rag_app/app_streamlit.py --server.port 8000

Open your browser and visit: http://localhost:8000
Folder Structure

rag-poisoning-lab/
│── rag_app/
│   ├── app_streamlit.py      # Main RAG UI
│   ├── knowledge_base.py     # Ingestion + mitigation logic
│   └── detection.py          # Perplexity + similarity detection
│
├── documents/                # Clean knowledge base (15 files)
├── poisoned_docs/            # Malicious payloads (injection, bias, leakage)
├── requirements.txt
└── README.md

🧪 What You Will Learn ?

🔴 Attack

Poisoning via malicious document ingestion
How semantic similarity causes poisoned chunks to be retrieved

🔍 Detect

Perplexity scoring for unnatural or adversarial text
Embedding similarity to identify outliers in vector space

🛡️ Mitigate

Keyword filtering during ingestion
Rebuilding FAISS index to purge poisoned content
Validating clean behavior after mitigation
These techniques are essential for securing real-world AI applications.

📜 Disclaimer

This project is provided for educational, academic, and training purposes only. Do not use any part of this repository to attack systems without explicit written permission. The authors assume no liability for misuse.

⭐ Contributing Suggestions and improvements are welcome.

You can submit issues or pull requests to expand:

Additional poisoning techniques
New detection modules
Hardening strategies for RAG pipelines

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ RAG Poisoning Lab — Educational AI Security Exercise

📘 Overview

🚀 Quick Start (Local Setup)

🧪 What You Will Learn ?

📜 Disclaimer

⭐ Contributing Suggestions and improvements are welcome.

You can submit issues or pull requests to expand:

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
documents		documents
poisoned_docs		poisoned_docs
rag_app		rag_app
README.md		README.md
requirements.txt		requirements.txt

r00tb3/RAG-Poisoning-Lab

Folders and files

Latest commit

History

Repository files navigation

🛡️ RAG Poisoning Lab — Educational AI Security Exercise

📘 Overview

🚀 Quick Start (Local Setup)

🧪 What You Will Learn ?

📜 Disclaimer

⭐ Contributing Suggestions and improvements are welcome.

You can submit issues or pull requests to expand:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages