data-pr-downloader

Dump of all datasets found in the dataset catalog @ https://data.pr.gov to disk. There are 148 datasets at the moment of the initial commit 2017-07-25. Please remember your disk space!

PRs are welcome!

What does it do?

All created files are saved to the data_files directory using the following steps:

Fetches the catalog of datasets from https://data.pr.gov/data.json
Saves the dataset catalog to disk with a timestamp.
Consumes dataset catalog and downloads all distributions for each dataset.
- All downloaded files will be named data.{file_type}

Running the script

Install pipenv 'cause we fancy.
Initialize a Python 3 virtual environment pipenv --three
Install dependencies pipenv install
Activate the virtual environment pipenv shell
Execute python data_pr_downloader.py

Running with docker

Run ./build.sh to build docker image
Run ./run.sh to fetch data. Files will be downloaded in the data_files directory.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data_files		data_files
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
build.sh		build.sh
data_pr_downloader.py		data_pr_downloader.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

data-pr-downloader

What does it do?

Running the script

Running with docker

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

froi/data-pr-downloader

Folders and files

Latest commit

History

Repository files navigation

data-pr-downloader

What does it do?

Running the script

Running with docker

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages