Skip to content
View muradali4442's full-sized avatar

Block or report muradali4442

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
muradali4442/README.md

Hi, I'm Murad Ali 👋

AI engineer in Ilmenau, Germany. I build production systems for search, question answering, speech and document understanding. I care about turning messy files like invoices and PDFs into clean data and about shipping services that are fast, stable and easy to observe.

EmailLinkedIn • Location: Ilmenau, DE


What I focus on

  • LLMs and retrieval with grounded answers, citations and simple chunking that fits the data
  • Speech and document AI with Whisper, Wav2Vec2, OCR and layout aware models for invoices and tables
  • Production work with FastAPI, Docker, CI and vLLM along with quantization, batching and streaming
  • Safety and evaluation with PII scrubbing, refusal rules, golden sets, EM and F1, RAGAS, A B tests and tracing

Impact highlights

  • Clinical assistant pilots cut lookup time from minutes to seconds and reduced post visit documentation by about 25 to 35 percent
  • Medical speech models improved word error rate by 6 to 8 points and kept real time behavior stable
  • Invoice and document extraction reached over 90 percent F1 on vendor, VAT, IBAN and totals and lowered manual review
  • Serving became about 30 percent faster at the median by using lower precision models and token streaming

Selected projects

  • Shift AI - LLM powered shift scheduling with constraint aware planning and a FastAPI backend
  • AI Agents Automation - Multi agent system that files and routes work, posts updates to chat and looks up fixes from a knowledge base
  • Early Dyslexia Detection - Multimodal screening that combines handwriting analysis and speech features
  • Invoice DocAI OCR and layout models with an LLM fallback for hard cases plus batching and caching
  • Clinical Consultation Assistant grounded answers with citations on Vertex AI and Cloud Run with PHI scrubbing

Tech I use

Python, C++FastAPILangChain, LangGraph, LlamaIndexvLLMFAISS, pgvector
Whisper, Wav2Vec2Tesseract, DocTR, LayoutLMv3, DonutGCP, Vertex AIPostgreSQLDocker, CIGit, Jira

How I work

I value readable code, clear APIs and measurable impact. I ship in small steps, watch the graphs and iterate. Open to collaboration and feedback. Issues and PRs are welcome.

Pinned Loading

  1. thesis_extractor thesis_extractor Public

    Use text + tables from PDFs for RAG (BM25 + LLM).

    Python 1