Skip to content

Exploratory Data Analysis (EDA) of retail sales using Python (pandas, matplotlib, seaborn). Includes data cleaning, visualizations, and business insights.

Notifications You must be signed in to change notification settings

BlladeRunner/sales-analysis-pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“ˆ Sales Analysis Project (Python + EDA)

πŸ“Š Project Overview

This project is an Exploratory Data Analysis (EDA) of the Superstore Sales dataset using Python.
The goal is to analyze sales performance, identify trends, and visualize key business metrics.

It covers sales growth over time, profitability by product and region, and the relationship between sales and profit.

πŸ“¦ Dataset: Superstore Sales Dataset


πŸ› οΈ Tools & Libraries

  • Python 3.11
  • pandas β€” data manipulation and cleaning
  • numpy β€” numerical analysis
  • matplotlib & seaborn β€” data visualization
  • Jupyter Notebook β€” interactive analysis environment

❓ Key Questions

  1. How have sales changed over time?
  2. Which products and categories bring the most profit?
  3. Which region is the most profitable?
  4. What are the best-selling subcategories?
  5. Is there a correlation between sales and profit?

🧠 Key Insights

  • πŸ“ˆ Sales show consistent growth over time.
  • 🧾 Office Supplies and Technology are the most profitable categories.
  • 🌎 The West region generated the highest profit margin.
  • πŸ“Š Sales and Profit have a positive correlation, with some noticeable outliers.
  • πŸ’‘ Discounts above 30 % negatively affect profit margins.

🧩 Project Structure

Sales_Analysis_Project/ β”œβ”€ data/ β”‚ └─ superstore_sales.csv β”œβ”€ notebooks/ β”‚ └─ sales_analysis.ipynb β”œβ”€ images/ β”‚ β”œβ”€ plots.png β”‚ └─ dashboard_preview.png └─ README.md


βš™οΈ How to Run

  1. Clone this repository
    git clone https://github.com/BlladeRunner/Sales_Analysis_Project.git
    cd Sales_Analysis_Project 
    python -m venv .venv
    .\.venv\Scripts\activate
    pip install -r requirements.txt
    jupyter notebook

Then open sales_analysis.ipynb and run all cells.

πŸ’Ό Business Relevance

  • The analysis provides insights that help retail and e-commerce managers:
  • Identify the most profitable product segments and regions.
  • Optimize discount strategies to maximize profit.
  • Forecast sales trends to support inventory and marketing decisions.

🧰 Skills Demonstrated

Category Details
Data Analysis Cleaning, aggregation, grouping, correlation
Visualization Matplotlib, Seaborn, trend analysis
Business Analytics Profitability, AOV, discount effect
Tools Python, Jupyter, GitHub

πŸ”™ Back to Portfolio

About

Exploratory Data Analysis (EDA) of retail sales using Python (pandas, matplotlib, seaborn). Includes data cleaning, visualizations, and business insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published