Loan Denied Project

Built several Neural Network and Machine Learning models (ANN, RandomFordst, and XGboost) to predict whether the bank should deny the loan application
Data processing to transform data and several Feature Engineering methods to fill with columns that have NA values.

Code and Resourced Used

Python 3.8
Packages: pandas, numpy, seaborn, matplotlib, sklearn, Xgboost, keras
[Xgboost parameter] (https://xgboost.readthedocs.io/en/latest/parameter.html)

Data Preprocessing

drop several columns which already contained in other feature or is not important (Name, LoanNr, State, City, ApprovalFY, DisbursementDate, BalanceGross, SBA_Appv, daysterm, xx)
Normalize or standardize to transform the numeric columns (NoEmp, CreateJob, RetainedJob, DisbursementGross, GrAppv)
Feature engineering with some columns (Zip, ApprovalDate, Term, FranchiseCode, RevLineCr, LowDoc)
1. Zip: only take second and third number since the first numbers is State information, which is all same. The second and third number means smaller region.
2. ApprovalDate: it is 5 numbrer format. So first change it to Y-m-d format, then take year and month information
3. Term: group 60, 84, 120, 240, 300 days together since it has a significantly larger amount of people apply at these durations, and also their MIS_Status situation is the same. I also Separate 36 days as a single category since it also has significantly more people apply at that duration compare to other days but it has different MIS_Status with the group above.
1. FranchiseCode: Seperate FranchiseCode = 1 and 2 as two category since there are much more people in these two group compare to others. group other FranchiseCode as one group
1. RevLineCr: it have some wrong value in the data set (0, T), so treat them as Null value
2. LowDoc: it have some wrong value in the data set (0, S, A), so treat them as Null value
one hot encoding for categorical paramete
Built Xgboost model to predict BankState, LowDoc, and RevLineCr Null value
- The reason for this predict order is because BankState and LowDoc have much lower null value (3, 8) compare to RevLineCr (786)

Fit and Predict models

ANN
RandomForest model
Xgboost model

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
images		images
Loan_Denied_Project.ipynb		Loan_Denied_Project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loan Denied Project

Code and Resourced Used

Data Preprocessing

Fit and Predict models

About

Uh oh!

Releases

Packages

Languages

FrankDTS/Loan-Denied-Project

Folders and files

Latest commit

History

Repository files navigation

Loan Denied Project

Code and Resourced Used

Data Preprocessing

Fit and Predict models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages