Welcome to the Data Science Job Market Insights repository! 🎉 This project dives into the data science job market using data scraped from Indeed.com. We’ve cleaned, analyzed, and visualized the data to uncover insights about job trends, required skills, and salary distributions across the United States.
- Scraped Data Science job listings from Indeed.com.
- Collected details like job titles, required skills, salaries,education required and locations and more.
- Processed and cleaned raw data collected by scraping .
- Perfromed Data engineering by extracting and creating new formatted data columns .
- Created interactive dashboards in Tableau to visualize insights:
- Top skills required by top employers
- Skill distribution across salary ranges
- Salary distribution analysis
- Education Level demand
- Geographic distribution of on-site job requirements
- Dashboard
- Python
- Tableau (for visualization)
- Google Colab / Jupyter Notebook
- Visual Studio Code or any python compiler of choice
- Required Python libraries:
selenium,pandas,numpy
- Clone the repository:
git clone https://github.com/Arian-Rahman/Data-Science-Job-Market-Insights.git
- Navigate to the project directory:
cd data-science-job-market-insights - Create virtual env (windows)
python -m venv envDsInsights
- Activate the new env
.\envDsInsights\Scripts\activate
- Install required Python libraries:
pip install -r requirements.txt
-
Data Scraping:
- Run
scrap_v4.01.pyto collect data from Indeed.com .
- Run
-
Data Cleaning:
- Run
Final_P01.03_data_cleaning.ipynbon Colab or Jupyter Notebook for data cleaning.
- Run
-
Data Analysis & Visualization:
- Open Tableau and load
Data Science Job Viz from Indeed.com.twbxto explore the dashboard. - Click on the data section and import the data from data folder
- Open Tableau and load
