Skip to content
View Ashish-Surve's full-sized avatar
😃
😃

Block or report Ashish-Surve

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Ashish-Surve/README.md

Ashish Surve’s GitHub Portfolio

A product-focused Data Scientist with over 6 years of experience building and shipping ML systems for CPG, retail, and real-estate domains. Proven expertise in demand forecasting, anomaly detection, and trade-promotion optimization, having trained 300 000+ models on multi-TB data to deliver $2.4 M in client savings. End-to-end ownership across problem framing, distributed model development, API deployment, monitoring, and stakeholder enablement.


🚀 Summary

Domains: CPG · Retail · Real Estate – Impact: 300 000+ models · $2.4 M savings · 100 M+ rows/day pipelines – End-to-End: Framing · Feature Engineering · Model Training · Deployment · Monitoring · Enablement


🛠️ Core Skills

Machine Learning & Analytics Forecasting · Time Series · Anomaly Detection · Trade Promotion Optimization · Product ML · Model Monitoring

Languages & Frameworks Python · SQL · PySpark · Pandas · NumPy · scikit-learn · PyTorch · TensorFlow · FastAPI · Streamlit

Data & Infrastructure Databricks · Spark · Dask · MongoDB · MySQL · Azure · AWS · Docker · CI/CD · Git · Hyperparameter Tuning · Distributed Training

NLP & LLMs RAG · Vector Stores · Semantic Search · OpenAI · Hugging Face

Tools & Libraries SHAP · PyOD · Vaex · Plotly · pre-commit


📂 Featured Repositories

Repository Description
resume-builder AI-powered resume optimization for ATS compatibility and job matching, using LLMs and semantic parsing.
Comparison_Segmentation_models Comparative study of segmentation models, served via FastAPI and Streamlit, Dockerized.
FaceMatcher Face recognition and matching utility using Python CV libraries.
Optical-Character-Recognition Benchmarking multiple OCR libraries for accuracy and performance.
Projects Sandbox for miscellaneous personal and experimental projects.
Linux_Programming C language examples and projects for Linux system programming.
Learn_Python Curated Python learning exercises and examples.
Learn_C-CPP C/C++ data structures implementations and project assignments.
Machine-Learning Collection of Python notebooks covering classic ML algorithms and tutorials.
remote_car Control a car via Wi-Fi, Bluetooth, and computer vision.
InterFusion_updated Fork of KDD’21 “Multivariate Time Series Anomaly Detection and Interpretation” with hierarchical embedding.
MNIST_AutoEncoders Autoencoder architectures for MNIST digit compression and reconstruction.
Segmentation_frontend Streamlit-based frontend for segmentation use cases (Heroku-deployed).
Segmentation_backend Backend API for segmentation model inference, decoupled service design.
style-transfer FastAPI + Streamlit web app for neural style transfer, Docker-ready.
ashish-surve.github.io Personal website codebase built with JavaScript, HTML, SCSS, and CSS.
Data-Science--Cheat-Sheet Forked collection of cheat sheets covering core data science concepts and commands.

📈 Impact Highlights

  • Trained 300 000+ models on multi-TB datasets, yielding $2.4 M in client savings.
  • Scaled forecasting pipelines to process 100 M+ rows per day.
  • Reduced anomaly detection false positives by 35% through advanced monitoring.

🏗️ Workflow & Process

  1. Problem Framing & Data Ingestion
  2. Distributed Feature Engineering (PySpark/Databricks)
  3. Model Development & Training (scikit-learn, PyTorch, TensorFlow)
  4. Hyperparameter Tuning & Automated CI/CD
  5. API Deployment (FastAPI + Docker)
  6. Monitoring & Alerting (Prometheus, Custom Dashboards)
  7. Stakeholder Enablement

📫 Contact

Let’s leverage data to drive impactful solutions!

Pinned Loading

  1. Comparison_Segmentation_models Comparison_Segmentation_models Public

    Brief comparison of segmentation models. Trained on Collab, served using Fast-API, streamlit and dockerized.

    Python 5 1

  2. resume-builder resume-builder Public

    Transform your resume with AI-powered optimization for better ATS compatibility and job matching.

    Python

  3. Projects Projects Public

    All projects should be placed here.

    Python 1

  4. FaceMatcher FaceMatcher Public

    Python 1

  5. Machine-Learning Machine-Learning Public

    Python

  6. Learn_Python Learn_Python Public

    Python