This repository showcases my hands-on data science work from the IBM Data Science Professional Certificate, organized by module. The strongest end-to-end projects are highlighted first.
Folder: 9. Applied Data Science Capstone/
Goal: analyze and predict Falcon 9 first-stage landing outcome using historical launch data.
What I did:
- Performed EDA using SQL + Python
- Built visualizations to uncover patterns and drivers of success
- Ran geospatial analysis with Folium (launch sites and outcome patterns)
- Built an interactive dashboard with Plotly Dash
- Trained and evaluated models to predict mission outcome Skills/tools: Python, SQL, EDA, data visualization, Folium, Dash, ML classification, model evaluation
Folder: 4. Python Project for Data Science/
Goal: extract and compare company revenue against stock price for Tesla and GameStop.
What I did:
- Extracted revenue data (API-based data acquisition)
- Cleaned and prepared time-series style data for comparison
- Built a dashboard-style visualization comparing stock price vs revenue Skills/tools: Python, data extraction (API), data cleaning, visualization/dashboarding
Folder: 6. Data Analysis with Python/
Goal: perform structured EDA on real estate data and build a reusable analysis workflow.
What I did:
- Data wrangling and preparation for analysis
- Outlier identification and descriptive statistics
- Built and evaluated a pipeline (preprocessing + modeling workflow) Skills/tools: Python, pandas, EDA, statistics, preprocessing pipelines, evaluation
Folder: 8. Machine Learning with Python/
Goal: build a machine learning classifier to predict rainfall.
What I did:
- Feature engineering and dataset preparation
- Built an end-to-end classification pipeline
- Optimized the model using GridSearchCV (cross-validation)
- Evaluated, refined, and compared model performance Skills/tools: scikit-learn, pipelines, GridSearchCV, cross-validation, classification metrics, feature engineering
- Python for data work: wrangling, EDA, feature prep, pipelines
- SQL analytics: querying, joins, aggregation for exploration and insight
- Visualization & dashboards: charts, geospatial mapping (Folium), interactive dashboards (Dash)
- Machine learning: classification workflows, tuning, evaluation, iteration
- End-to-end delivery: problem framing → data acquisition → analysis/modeling → communication
1. Data Science Methodology/— problem framing and analytic approach2. Tools for Data Science/— DS tooling and workflow foundations3. Python for Data Science, AI & Development/— Python foundations applied to DS4. Python Project for Data Science/— Tesla vs GameStop revenue dashboard project5. Databases and SQL for Data Science with Python/— SQL + Python for analytics6. Data Analysis with Python/— EDA + pipeline-based analysis (real estate analytics)7. Python for Data Visualization/— visualization techniques and storytelling8. Machine Learning with Python/— rainfall prediction classifier + tuning/evaluation9. Applied Data Science Capstone/— SpaceX Falcon 9 outcome analysis + prediction10. Generative AI. Elevate your Data Science career/— GenAI concepts for DS workflows11. Data Scientist career guide and interview preparation/— career/interview materials
GitGuideCommands.pdf— Git command guidePython Cheat Sheet - The Basics Coursera.pdfPython cheat sheet working with data.pdfPython_reference_sheet.pdf
Portfolio/educational use. Please attribute appropriately and respect any upstream course/lab licensing where applicable.