Brighthive Mock Data Generator

This repository contains scripts to generate synthetic datasets that support Brighthive demos and storytelling for various use cases. The generated data is designed to be realistic and representative of real-world scenarios while maintaining privacy and security.

Purpose

The mock data generator serves several key purposes:

Create realistic datasets for demonstration purposes
Support storytelling and use case presentations
Enable testing and development without using real production data
Provide consistent, reproducible data for demos and training

Getting Started

Prerequisites

Python 3.7 or higher
Required Python packages (install using pip):
```
pip install -r requirements.txt
```

Directory Structure

brighthive-mock-data/
├── documentation/     # Documentation files
├── output/           # Generated data files
├── scripts/          # Data generation scripts
└── requirements.txt  # Python dependencies

Generating Datasets

Step-by-Step Example: Generating CRM Data

Create a virtual environment:
```
python -m venv venv
```
Activate the virtual environment:
```
source venv/bin/activate
```
Install the required dependencies:
```
pip install -r requirements.txt
```
Run the data generation script. Here is an example for CRM data:
```
python scripts/generate_crm_data.py
```
The script will:
- Create an output directory if it doesn't exist
- Generate synthetic CRM data with realistic fields
- Save the data as a CSV file in the output directory with the current date
- The output file will be named crm_data_MM-DD.csv
Verify the generated data:
- Check the output directory for the new CSV file

Available Datasets

The repository includes generators for various types of data:

CRM data
Healthcare data
Financial data
Student data
Web analytics data
Health devices data

Each script generates data specific to its domain while maintaining realistic relationships and patterns.

Contributing

When adding new data generators:

Follow the existing code structure and patterns
Include appropriate documentation
Use realistic data ranges and distributions
Ensure data privacy and security
Add your script to this README's "Available Datasets" section

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
output		output
scripts		scripts
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Brighthive Mock Data Generator

Purpose

Getting Started

Prerequisites

Directory Structure

Generating Datasets

Step-by-Step Example: Generating CRM Data

Available Datasets

Contributing

About

Uh oh!

Releases

Packages

Languages

brighthive/brighthive-mock-data

Folders and files

Latest commit

History

Repository files navigation

Brighthive Mock Data Generator

Purpose

Getting Started

Prerequisites

Directory Structure

Generating Datasets

Step-by-Step Example: Generating CRM Data

Available Datasets

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages