Skip to content
Anthony Miyaguchi edited this page Aug 16, 2017 · 1 revision

Python MozETL

What is this repository

python_mozetl is a collection of scripts that extract, transform, and load data from and to various locations. This repository provides a common location for these scripts that would otherwise reside in notebooks, gists, and miscellaneous locations that don't quite fit. This repository provides an interface that inter-ops with various scheduling mechanisms (airflow, ATMO), the infrastructure to develop and test jobs, and a central location for peer review.

What is an ETL job?

Why Python?

How does this repository work?

Repository organization

Clone this wiki locally