Skip to content
View MDavidHernandezP's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MDavidHernandezP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MDavidHernandezP/README.md

👋 Hi there, I’m Mario David Hernández Pantoja

🚀 Data Engineer | Cloud Engineer
🎓 Bachelor in Data Engineering
🌎 Mexico | Open to Remote & International Opportunities

Currently a Data Engineer willing to gain more professional experience in the industry, but already working various skills in the field, including Cloud Computing, ETL Pipelines, DevOps and Data Analytics/Science. My experience spans AWS, Azure, and GCP, combining infrastructure as code with modern data engineering practices to deliver reliable and cost-efficient systems.


🌐 Connect with Me

LinkedIn GitHub Credly


🧠 Expertise

💡 Data Engineering & Analytics
I specialize in building end-to-end data pipelines, from ingestion to analytics-ready datasets. I have hands-on experience with batch and streaming workloads, data warehousing, and data lake architectures, ensuring data quality and reliability at scale.

☁️ Cloud & DevOps
I apply DevOps principles to data systems, leveraging Infrastructure as Code, containerization, and CI/CD concepts. I’ve worked extensively with cloud services to migrate on-premise infrastructures and optimize operational costs.

⚙️ Automation & Reliability
I focus on automation-first solutions, reducing manual processes through scripting, orchestration, and workflow scheduling. I value clean architectures, observability, and reproducibility.


🛠️ Tech Stack

👨‍💻 Programming & Scripting

Python SQL Bash R C

☁️ Cloud Platforms

AWS Azure Google Cloud

🧱 Data & Big Data

Airflow Databricks Snowflake Spark Kafka Hadoop DBT

🐳 DevOps & Tooling

Docker Kubernetes Terraform Ansible Jenkins GitLab GitHub Actions

🗄️ Databases & Storage Systems

PostgreSQL MySQL SQL Server Oracle SQLite MariaDB

MongoDB Cassandra DynamoDB Milvus

Amazon RDS Amazon Timestream Azure SQL Cloud SQL


🚀 Featured Projects

📊 Data Warehouse Pipeline ETL

Airflow · Python · Docker · SQL

  • Built a multi-source ETL pipeline integrating MSSQL, MySQL, PostgreSQL, Oracle, and SQLite
  • Automated ingestion into a centralized Data Warehouse
  • Designed with scalability for DBT modeling and data quality frameworks

🔗 https://github.com/MDavidHernandezP/DataWarehousePipelineETL


🛒 Online Store Web Scraper API

FastAPI · MongoDB · Docker · Power BI

  • Developed an API to automate web data extraction
  • Stored and managed semi-structured data with MongoDB
  • Performed data cleaning, visualization, and analytics

🔗 https://github.com/MDavidHernandezP/OnlineStoreWebScraperAPI


🎓 Certifications

  • AWS Academy Cloud Architecting
  • AWS Academy Data Engineering
  • Databricks Fundamentals
  • CCNA: Introduction to Networks
  • Google Cloud Computing Foundations

🔗 Full list on Credly:
https://www.credly.com/users/mario-david-hernandez-pantoja/badges

Popular repositories Loading

  1. OnlineStoreWebScraperAPI OnlineStoreWebScraperAPI Public

    This repository is a Web Scraper of Mercado Libre that saves the data in a MongoDB and can be accessed by requesting an API, then the data is sent to PowerBI to make the processes of Data Cleaning,…

    Python 1

  2. ProjectoEstanciaCinvestav ProjectoEstanciaCinvestav Public

    Este repositorio fue creado con el proposito de guardar el trabajo realizado por el Dr. Juan Iván Gómez Peralta, mi compañero Luis Arturo Michel Pérez y su servidor, en nuestro projecto de estancia…

    Jupyter Notebook 1

  3. FrontEndMiniProjects FrontEndMiniProjects Public

    This repository contains all the Fron-end mini projects I have done for practicing and fun.

    CSS 1

  4. PersonalRepository PersonalRepository Public

    Repository to save my CVs and all my certificates of courses and things like that.

  5. CProgrammingCodes CProgrammingCodes Public

    Repository to upload my C codes from my first programming subject.

    C

  6. MLFaceMaskDetectorTeachableMachine MLFaceMaskDetectorTeachableMachine Public

    This repository was made to save an old project from Preparatory/Highschool in which I made a Facemask detector in python using a page called Teachable Machine which creates you a Machine Learning …

    Python