Structured-RAG

Metadata-Enriched-RAG with auto-generated tags for improved query generation

Overview

This project implements a containerized .NET solution that uses Docker Model Runner (running Gemma 3) to generate optimized tags for entities stored in a SQL Server database. The tags are designed for Retrieval-Augmented Generation (RAG) workflows, where the LLM analyzes user queries, selects relevant tags, and filters entities before performing vector search.

Features

Automated Tag Generation: Uses LLM (Docker Model Runner) to generate relevant tags for entities
Tag Consistency: Considers existing tags to maintain consistency across the system
RAG Query Processing: LLM-driven tag selection for optimal entity retrieval
Containerized Architecture: Fully containerized with Docker Compose
EF Core with SQL Server: Entity Framework Core for database management

Architecture

The solution consists of three main components:

SQL Server: Stores entities and their generated tags
Docker Model Runner: Provides LLM capabilities for tag generation and query processing
.NET Application: Orchestrates tag generation and RAG query processing

Prerequisites

Docker Desktop or Docker Engine with Docker Compose
At least 8GB RAM available for containers
10GB free disk space (for model storage)

Getting Started

1. Clone the Repository

git clone https://github.com/timonkrebs/Structured-RAG.git
cd Structured-RAG

2. Start the Services

docker-compose up --build

This will:

Start SQL Server
Build and run the .NET application
Seed sample data
Generate tags for entities
Run a demo RAG query

3. Monitor the Logs

Watch the application logs to see:

Database initialization
Sample data seeding
Tag generation progress
RAG query demonstration

Project Structure

Structured-RAG/
├── StructuredRAG.Api/
│   ├── Models/
│   │   ├── Entity.cs              # Entity model
│   │   └── Tag.cs                 # Tag model
│   ├── Data/
│   │   └── ApplicationDbContext.cs # EF Core DbContext
│   ├── Services/
│   │   ├── DockerModelRunnerService.cs     # LLM interaction service
│   │   ├── TagGenerationService.cs # Tag generation logic
│   │   └── RagQueryService.cs     # RAG query processing
│   ├── Program.cs                  # Application entry point
│   └── appsettings.json           # Configuration
├── Dockerfile                      # Application container
├── docker-compose.yml              # Multi-container orchestration
└── README.md

How It Works

Tag Generation Process

Load Entities: Retrieves entities from SQL Server using EF Core
Get Existing Tags: Fetches all existing tags to maintain consistency
Generate Tags: Uses Docker Model Runner (Gemma 3) to generate 3-7 optimized tags per entity
Store Tags: Saves new tags to the database

RAG Query Process

Receive Query: User submits a natural language query
Load Available Tags: Retrieves all tags from the database
Select Relevant Tags: LLM analyzes query and selects relevant tags
Filter Entities: Queries entities that match the selected tags
Generate Response: LLM generates answer based on filtered entities

Configuration

Database Connection

Edit appsettings.json or set environment variable:

{
  "ConnectionStrings": {
    "DefaultConnection": "Server=sqlserver;Database=StructuredRAG;User Id=sa;Password=YourStrong@Passw0rd;TrustServerCertificate=True"
  }
}

Docker Model Runner Endpoint

Edit appsettings.json or set environment variable:

{
  "DockerModelRunner": {
    "Endpoint": "http://model-runner.docker.internal/engines/llama.cpp/v1"
  }
}

Customization

Adding New Entities

Modify Program.cs to add your own entities:

var entities = new[]
{
    new Entity
    {
        Name = "Your Entity Name",
        Content = "Your entity content...",
        CreatedAt = DateTime.UtcNow
    }
};

Adjusting Tag Generation

Modify TagGenerationService.cs to customize:

Number of tags generated
Tag generation prompt
Tag selection criteria

Changing the Model

Edit docker-compose.yml to use a different model in the models section:

models:
  llm:
    model: ai/granite-4.0-nano:latest

Update appsettings.json to configure the model name:

"DockerModelRunner": {
  "Endpoint": "http://model-runner.docker.internal/engines/llama.cpp/v1",
  "Model": "ai/granite-4.0-nano:latest"
}

Security Note: Always use pinned version tags (e.g., ai/granite-4.0-nano:latest) or SHA256 digests instead of mutable tags like :latest to ensure reproducible builds and prevent potential security issues from upstream changes.

Stopping the Application

docker-compose down

To remove volumes (data will be lost):

docker-compose down -v

Troubleshooting

Container Startup Issues

If containers fail to start, ensure:

Docker has enough resources (8GB RAM minimum)
Port 1433 is available
No other SQL Server instances are running

Database Connection Issues

Verify SQL Server is healthy:

docker-compose ps
docker-compose logs sqlserver

License

See LICENSE file for details.

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
StructuredRAG.Core		StructuredRAG.Core
StructuredRAG.Example		StructuredRAG.Example
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
DEVELOPMENT.md		DEVELOPMENT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SOLUTION_SUMMARY.md		SOLUTION_SUMMARY.md
StructuredRAG.sln		StructuredRAG.sln
docker-compose.yml		docker-compose.yml
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Structured-RAG

Overview

Features

Architecture

Prerequisites

Getting Started

1. Clone the Repository

2. Start the Services

3. Monitor the Logs

Project Structure

How It Works

Tag Generation Process

RAG Query Process

Configuration

Database Connection

Docker Model Runner Endpoint

Customization

Adding New Entities

Adjusting Tag Generation

Changing the Model

Stopping the Application

Troubleshooting

Container Startup Issues

Database Connection Issues

License

Contributing

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

timonkrebs/Structured-RAG

Folders and files

Latest commit

History

Repository files navigation

Structured-RAG

Overview

Features

Architecture

Prerequisites

Getting Started

1. Clone the Repository

2. Start the Services

3. Monitor the Logs

Project Structure

How It Works

Tag Generation Process

RAG Query Process

Configuration

Database Connection

Docker Model Runner Endpoint

Customization

Adding New Entities

Adjusting Tag Generation

Changing the Model

Stopping the Application

Troubleshooting

Container Startup Issues

Database Connection Issues

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages