dataform-co / dataform

Dataform is a framework for managing SQL based data operations in BigQuery
https://cloud.google.com/dataform/docs
Apache License 2.0
851 stars 163 forks source link
analytics business-intelligence data-engineering data-pipelines elt etl hacktoberfest

Dataform Core

Dataform Core is an open source meta-language to create SQL tables and workflows in BigQuery. Dataform Core extends SQL by providing a dependency management system, automated data quality testing, and data documentation.

Using Dataform Core, data teams can build scalable SQL data transformation pipelines following software engineering best practices, like version control and testing.

For more details, see how Dataform works.

Data collections and integrations feed into Dataform, which exports this data to BI and analytics tools.

Get started

In Google Cloud Platform

Dataform in Google Cloud Platform provides a fully managed experience to build scalable data transformations pipelines in BigQuery using SQL. It includes:

Follow the quickstart guide!

With the CLI

You can run Dataform locally using the Dataform CLI tool, which can be installed using the following command line. Follow the CLI guide to get started.

npm i -g @dataform/cli

Useful Links

Note: this readme can also be viewed on https://dataform-co.github.io/dataform.

Example Projects

Want to report a bug or request a feature?

Want to contribute?

Check out our contributors guide to get started with setting up the repo.