Predict Developer Salary Using Gradient Boosting Regressor

This project aims to predict the salary of developers using the Stack Overflow Developer Survey 2022 dataset. The prediction model is built using the Gradient Boosting Regressor (GBR) algorithm.

Introduction

In this project, we use the Stack Overflow Developer Survey 2022 dataset to predict developer salaries based on various features such as experience, location, education, etc. The Gradient Boosting Regressor (GBR) model is employed for this task due to its robustness and accuracy in regression problems.

Dataset

The dataset used in this project is the Stack Overflow Developer Survey 2022 dataset, which contains information about developers' demographics, education, experience, and salaries. The dataset can be downloaded from the Stack Overflow Developer Survey website.

Libraries and Tools

The following libraries and tools are used in this project:

Python 3.x
Jupyter Notebook
Pandas
NumPy
Scikit-learn
Matplotlib
Seaborn

Data Preprocessing

Data preprocessing steps include:

Handling missing values
Encoding categorical variables
Feature scaling
Train-test split

Model Training

The Gradient Boosting Regressor (GBR) model is trained on the preprocessed dataset. Hyperparameter tuning is performed to optimize the model's performance. The key steps involved in model training are:

Importing the necessary libraries and dataset
Data cleaning and preprocessing
Splitting the dataset into training and testing sets
Training the GBR model
Hyperparameter tuning using Grid Search

Evaluation

The model's performance is evaluated using various metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), and R-squared score. The evaluation includes:

Calculating the performance metrics on the test set
Visualizing the actual vs predicted salaries
Analyzing feature importance

Results

The results of the model are analyzed and visualized to understand its performance and the importance of different features in predicting developer salaries. Key findings and visualizations include:

The performance metrics of the model
Feature importance plot
Actual vs predicted salary plot

Usage

To use this notebook, follow these steps:

Clone the repository:

git clone https://github.com/AmirHosseinSoleymani/Pred-Developer-Salary-With-GBR-model-.git

Navigate to the project directory:

cd Pred-Developer-Salary-With-GBR-model-

Open the Jupyter Notebook:

jupyter notebook programmer-salaray-pred-stackoverflowdataset.ipynb

Follow the instructions in the notebook to run the code and reproduce the results.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request if you have any improvements or suggestions.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Acknowledgements

We would like to thank Stack Overflow for providing the dataset used in this project.

Feel free to customize the above template further based on the specific details and findings in your Jupyter notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
app.py		app.py
gbr_pred_salary_dev_v2.joblib		gbr_pred_salary_dev_v2.joblib
onehot_encoder.joblib		onehot_encoder.joblib
programmer-salaray-pred-stackoverflowdataset.ipynb		programmer-salaray-pred-stackoverflowdataset.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Predict Developer Salary Using Gradient Boosting Regressor

Table of Contents

Introduction

Dataset

Libraries and Tools

Data Preprocessing

Model Training

Evaluation

Results

Usage

Contributing

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

AmirHosseinSoleymani/Pred-Developer-Salary-With-GBR-model-

Folders and files

Latest commit

History

Repository files navigation

Predict Developer Salary Using Gradient Boosting Regressor

Table of Contents

Introduction

Dataset

Libraries and Tools

Data Preprocessing

Model Training

Evaluation

Results

Usage

Contributing

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages