Skip to content
View suryaprakash-sp's full-sized avatar

Block or report suryaprakash-sp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
suryaprakash-sp/README.md

πŸ‘‹ Hi, I'm Surya Prakash Manubolu

Data Analyst & Engineer | Building Production-Scale Data Systems

Transforming complex data into actionable insights through scalable pipelines, interactive dashboards, and intelligent automation

LinkedIn Email Portfolio

Profile views

πŸš€ About Me

I'm a Data Analyst with 3+ years of experience delivering high-impact data solutions in fast-paced EdTech environments. I specialize in designing end-to-end ETL pipelines, building interactive dashboards, and creating automation systems that drive business decisions.

πŸ’‘ What I Do Best

  • πŸ“Š ETL Pipeline Architecture: Built 20+ production pipelines processing millions of rows daily
  • πŸ“ˆ Dashboard Engineering: Created 30+ Metabase dashboards with 80% faster load times
  • πŸ€– Data Automation: Eliminated 20+ hours of manual work weekly through intelligent automation
  • πŸ—„οΈ Database Optimization: Reduced pipeline count by 50% through analytics-ready schema redesign

🎯 Impact Metrics

πŸ“¦ Data Infrastructure      β†’ Supporting 25,000+ active students
⚑ Performance Optimization β†’ 80% reduction in dashboard query time
πŸ”„ Pipeline Efficiency     β†’ 50% reduction in ETL pipeline count (40+ β†’ 20)
⏱️ Automation Savings       β†’ 20 hours of manual work eliminated weekly
πŸ’Ύ Data Processing         β†’ Millions of rows processed daily

πŸ› οΈ Tech Stack

Languages & Core Tools

Python SQL

Databases

MySQL PostgreSQL MongoDB

BI & Visualization

Metabase Power BI Excel Google Sheets

Data Engineering & Libraries

Pandas SQLAlchemy Selenium

Tools & Platforms

Git API Integration ETL Pipelines


πŸ’Ό Current Role - Masai School

Data Analyst | Oct 2024 – Present | Bangalore

Building and managing data infrastructure that powers decision-making for 25,000+ students:

  • πŸ—οΈ Infrastructure Management: End-to-end data infrastructure across 50+ courses
  • πŸ“Š Dashboard Engineering: 30+ Metabase dashboards with 80% improved load times
  • πŸ€– Automation Systems: Google Docs API/Gmail API automation saving 20 hours weekly
  • πŸ—„οΈ Schema Optimization: Redesigned database schema reducing ETL pipelines by 50%
  • πŸ”„ Production Pipelines: 20 ETL pipelines processing millions of rows daily (MySQL/MongoDB β†’ PostgreSQL)

🎯 Featured Projects

πŸ† Production ETL Pipeline Infrastructure

Tech Stack: Python β€’ MySQL β€’ MongoDB β€’ PostgreSQL

A comprehensive data pipeline infrastructure that transformed data operations at scale:

  • βœ… Developed 20 production ETL pipelines processing millions of rows daily
  • βœ… Migrated data from MySQL/MongoDB to PostgreSQL for analytics
  • βœ… Reduced dashboard query time by 80% through optimized data modeling
  • βœ… Cut pipeline count by 50% (40+ β†’ 20) while maintaining analytical depth
  • βœ… Automated data validation and error handling for production reliability

Impact: Powers analytics for 25,000+ students with near-real-time data availability

πŸ“Š Student Lifecycle Dashboard

Tech Stack: Python β€’ Metabase β€’ PostgreSQL

Comprehensive dashboard system providing 360Β° visibility into student performance:

  • βœ… Engineered 30+ interactive Metabase dashboards using Python
  • βœ… Improved dashboard load times by 80% through query optimization
  • βœ… Provides 100% visibility into student lifecycle for 40+ business users
  • βœ… Real-time tracking of enrollment, engagement, and performance metrics
  • βœ… Custom KPI calculations and automated reporting workflows

Impact: Enabled data-driven decision making across all departments

πŸ’° Business Finance Analytics Tool

Tech Stack: Python β€’ Google Sheets β€’ API Integration

Automated financial tracking system for business intelligence:

  • βœ… Built Python automation with session-based authentication
  • βœ… Extracted data from myBillBook platform automatically
  • βœ… Monitored INR 1M+ inventory across 400+ products
  • βœ… Tracked pricing trends, sales performance, and P&L metrics
  • βœ… Real-time dashboards updated via Google Sheets API

Impact: Eliminated manual data entry and provided real-time financial insights

πŸ€– WhatsApp Messaging Automation

Tech Stack: Python β€’ Google Sheets β€’ WATI API

Scalable communication automation system for student engagement:

  • βœ… Created automated WhatsApp messaging system using WATI API
  • βœ… Integrated with Google Sheets and Forms for data management
  • βœ… Enabled instant and one-click message delivery at scale
  • βœ… Supported targeted campaigns for student engagement
  • βœ… Reduced manual communication time by 90%

Impact: Streamlined communication with thousands of students

πŸ” Personal Food Ordering Analysis

Tech Stack: Python β€’ Selenium β€’ Excel β€’ Google Sheets

Personal analytics project demonstrating web scraping and visualization skills:

  • βœ… Deployed web scraping solution using Selenium
  • βœ… Extracted 200+ personal orders from Swiggy platform
  • βœ… Constructed interactive dashboard visualizing ordering patterns
  • βœ… Analyzed spending trends and food preferences
  • βœ… Built automated data collection pipeline

Impact: Showcased end-to-end data analysis and automation capabilities


πŸ“ˆ GitHub Stats

Contribution Stats Profile Summary
Activity Graph

πŸŽ“ Education

πŸŽ“ Masai School - Data Analyst Bootcamp Apr 2024 – Sep 2024

πŸŽ“ CVR College of Engineering - B.Tech in Electronics & Communication Engineering 2018 – 2022 | Hyderabad


πŸ’» Skills Breakdown

Data Engineering

  • ⚑ ETL Pipeline Design & Development
  • πŸ—„οΈ Database Schema Design & Optimization
  • πŸ”„ Data Modeling & Transformation
  • πŸ“Š Data Warehouse Architecture
  • πŸ”— API Integration & Web Scraping

Analytics & BI

  • πŸ“ˆ Dashboard Development
  • πŸ“Š Data Visualization & Storytelling
  • 🎯 KPI Design & Metric Tracking
  • πŸ“‰ Reporting Automation
  • πŸ’‘ Business Intelligence Strategy

🌟 What Makes Me Different

Most data analysts stop at insights. I go further:

πŸ”„ End-to-End Ownership πŸ“Š Production-First Mindset ⚑ Performance Obsessed
From raw data extraction to dashboard deployment Code that runs in production, not just notebooks 80% faster queries, 50% fewer pipelines

My Approach

Problem β†’ Understand the business need, not just the data request
Design β†’ Build scalable solutions that handle edge cases
Deploy β†’ Ship production-ready code with error handling
Optimize β†’ Continuously improve performance and efficiency

Bottom line: I build data systems that teams depend on, not just one-off analyses.


🀝 Let's Connect!

I'm always interested in discussing:

  • πŸ’‘ Data engineering architecture & best practices
  • πŸš€ ETL pipeline optimization strategies
  • πŸ“Š Dashboard design & BI tool selection
  • πŸ€– Data automation opportunities
  • πŸ’Ό Collaboration on data-driven projects

LinkedIn Email Portfolio Phone


"Data is the new oil, but insights are the refined fuel that drives decisions."

Popular repositories Loading

  1. AMD_Architects_064- AMD_Architects_064- Public

    This project involves extracting, cleaning, and analyzing data from the 1mg homeopathic medicine website. Key stages include web scraping, data cleaning, analysis using SQL/NoSQL, and visualization…

    Python 1 2

  2. portfolio portfolio Public

    Modern interactive portfolio showcasing data engineering and analytics expertise. Built with React, TypeScript, and AI-powered chat assistant.

    TypeScript 1

  3. swadha-automation swadha-automation Public

    Python 1

  4. suryaprakash-sp suryaprakash-sp Public

    TypeScript

  5. chartdb chartdb Public

    Forked from chartdb/chartdb

    Database diagrams editor that allows you to visualize and design your DB with a single query.

    TypeScript

  6. mybillbook_scrape mybillbook_scrape Public

    Python