CICDDoS2019 — Exploitative Attack Data Preprocessing & Analysis

A reproducible notebook for preprocessing exploitative attack data from the CICDDoS2019 dataset.
This repo streamlines how I handle, clean, and structure this dataset for downstream ML/DL experiments on DDoS detection.

Highlights

Loads CICDDoS2019 exploitative attack flows
Cleans feature set, encodes categorical variables
Balances class distribution with stratified splits
Provides baseline EDA plots and class stats
Exports clean CSV for training anomaly detection models

Notebook: notebooks/Preprocessing_Exploitative_Attack_Data_from_the_CICDDOS2019_Dataset.ipynb

Dataset

Source: CICDDoS2019 (Canadian Institute for Cybersecurity)
Download the original dataset from CIC and place CSVs in data/
Update notebook paths if needed

Environment (Python)

python -m venv .venv
source .venv/bin/activate   # Windows: .venv\Scripts\activate
pip install -r requirements.txt
# Launch Jupyter
pip install notebook
jupyter notebook

Requirements

pandas
numpy
scikit-learn
matplotlib

Project Structure

cicddos2019-preprocessing/
├─ notebooks/
│  └─ Preprocessing_Exploitative_Attack_Data_from_the_CICDDOS2019_Dataset.ipynb
├─ docs/
├─ scripts/
├─ .gitignore
├─ LICENSE
├─ README.md
└─ requirements.txt

Roadmap

Add helper script to preprocess without Jupyter
Add visualizations for class imbalance
Add ML baselines on processed data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CICDDoS2019 — Exploitative Attack Data Preprocessing & Analysis

Highlights

Dataset

Environment (Python)

Requirements

Project Structure

Roadmap

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
notebooks		notebooks
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

hconyeka/cicddos2019-preprocessing

Folders and files

Latest commit

History

Repository files navigation

CICDDoS2019 — Exploitative Attack Data Preprocessing & Analysis

Highlights

Dataset

Environment (Python)

Requirements

Project Structure

Roadmap

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages