Skip to content

Multi Modal Agent using Deepgram and Groq LPU's and Sentence Transformers for Vector Embeddings, ChromaDB for persistent vector db storage and BM25 with ColBERT reranking for hybrid search for RAG context management

Notifications You must be signed in to change notification settings

RodneyFinkel/groq_deepgram_agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Rag Agent demo

This demo showcases a MultiModal AI RAG Agent that leverages Text-To-Speech (TTS) and Speech-To-Text (STT) for LLM interactions using Deepgram and Groq LPU's.

Sentence Tranformers to build vector embeddings for the user message and uploaded documents that undergo cosine similarity testing to find the most relevant, for LLM context management. Dense and Sparse retrieval pipelines with Hybrid Search options. BM25 search algorithm with Colbert Reranking

DB connection through SQLAlchemy/ChromaDB for transcription sessions.

The demo is designed to stream STT and TTS to enhance speed.

INSTALLATION macos:

  1. brew install ffmpeg and portaudio
  2. pip install -r requirements.txt

windows powershell:

  1. cd C:
    curl -L -o ffmpeg-release-essentials.zip https://www.gyan.dev/ffmpeg/builds/ffmpeg-release-essentials.zip

  2. Extract the FFmpeg Package: powershell -command "Expand-Archive -Path .\ffmpeg-release-essentials.zip -DestinationPath C:\ffmpeg"

  3. Add FFmpeg to the System PATH: setx /M PATH "%PATH%;C:\ffmpeg\ffmpeg-\bin" ###Replace with the actual version directory inside C:\ffmpeg (e.g., ffmpeg-5.1-essentials_build)###

LAUNCH FLASK WEB APP:

Screenshot 2025-09-29 at 4 53 39

python3 alpha_app2.py

Toggle the sidebar for the AI RAG AGENT

Screenshot 2025-08-31 at 3 12 15

Screen Shot 2024-06-14 at 1 39 37

CLI: python3 Quickagent.py

Create .env file for: GROQ_API_KEY = "" DEEPGRAM_API_KEY = ""

MAIL_USERNAME = "" MAIL_PASSWORD = "" MAIL_DEFAULT_SENDER = ""

OPENWEATHER_API_KEY = "" X-Api-Key =

About

Multi Modal Agent using Deepgram and Groq LPU's and Sentence Transformers for Vector Embeddings, ChromaDB for persistent vector db storage and BM25 with ColBERT reranking for hybrid search for RAG context management

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published