Skip to content

Cloud ETL pipeline for LendingClub 2018Q4 loan data using Azure Databricks (Spark), ADLS Gen2, and Azure SQL. Includes notebooks, PySpark modules, and SQL scripts.

License

Notifications You must be signed in to change notification settings

AtharvaPatil-Data/Azure-Databricks-ETL-Loan-Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Azure Databricks ETL Loan Pipeline

Cloud ETL pipeline for LendingClub 2018Q4 loan data using Azure Databricks (Spark), ADLS Gen2, and Azure SQL.
Includes a single Colab/Databricks notebook, project report, and an Azure setup walkthrough with screenshots.

Why it’s useful: cleans messy, semi‑structured finance fields (e.g., "36 months", % rates), standardizes types, derives “good/bad” loan flags, and lands curated data in Azure SQL for BI/analytics.

About

Cloud ETL pipeline for LendingClub 2018Q4 loan data using Azure Databricks (Spark), ADLS Gen2, and Azure SQL. Includes notebooks, PySpark modules, and SQL scripts.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published