mozilla / bigquery-etl

Bigquery ETL
https://mozilla.github.io/bigquery-etl
Mozilla Public License 2.0
253 stars 100 forks source link

CircleCI

BigQuery ETL

This repository contains Mozilla Data Team's:

For more information, see https://mozilla.github.io/bigquery-etl/

Quick Start

Pre-requisites

GCP CLI tools

Installing bqetl

  1. Clone the repository

    git clone git@github.com:mozilla/bigquery-etl.git
    cd bigquery-etl
  2. Install the bqetl command line tool

    ./bqetl bootstrap
  3. Install standard pre-commit hooks

    venv/bin/pre-commit install

Finally, if you are using Visual Studio Code, you may also wish to use our recommended defaults:

cp .vscode/settings.json.default .vscode/settings.json
cp .vscode/launch.json.default .vscode/launch.json

And you should now be set up to start working in the repo! The easiest way to do this is for many tasks is to use bqetl. You may also want to read up on common workflows.

Releasing a new version of bqetl

To push a new version of bqetl to PyPI, update the version in pyproject.toml. The version numbers follow the CalVer scheme, with the Micro version numbers starting at 1. For example, for the first package version getting published in March 2024, the version would be 2024.3.1.