bigquery_fdw is a BigQuery foreign data wrapper for PostgreSQL using Multicorn.
It allows to write queries in PostgreSQL SQL syntax using a foreign table. It supports most of BigQuery's data types and operators.
See getting started with Docker
You need to install the following dependencies:
# Install required packages
apt update
apt install -y postgresql-server-dev-14 python3-setuptools python3-dev make gcc git
All PostgreSQL versions from 9.2 to 14 should be supported.
# Install Multicorn
# pgsql-io/multicorn2 is a fork of Segfault-Inc/Multicorn that adds support for PostgreSQL 13/14.
# Alternatively, up to PostgreSQL 12, you can use gabfl/Multicorn that adds better support for Python3.
# You may also choose to build against the original project instead.
git clone https://github.com/pgsql-io/multicorn2.git Multicorn && cd Multicorn
make && make install
# Install bigquery_fdw
pip3 install bigquery-fdw
Major dependencies installed automatically during the installation process:
bigquery_fdw relies on Google Cloud API's default authentication.
Your need to have an environment variable GOOGLE_APPLICATION_CREDENTIALS
that has to be accessible by bigquery_fdw. Setting environment variables varies depending on OS but for Ubuntu or Debian, the preferred way is to edit /etc/postgresql/[version]/main/environment
and add:
GOOGLE_APPLICATION_CREDENTIALS = '/path/to/key.json'
Restarting PostgreSQL is required for the environment variable to be loaded.
We recommend testing the BigQuery client connectivity before trying to use the FDW.
With psql
:
CREATE EXTENSION multicorn;
CREATE SERVER bigquery_srv FOREIGN DATA WRAPPER multicorn
OPTIONS (
wrapper 'bigquery_fdw.fdw.ConstantForeignDataWrapper'
);
CREATE FOREIGN TABLE my_bigquery_table (
column1 text,
column2 bigint
) SERVER bigquery_srv
OPTIONS (
fdw_dataset 'my_dataset',
fdw_table 'my_table'
);
List of options implemented in CREATE FOREIGN TABLE
syntax:
Option | Default | Description |
---|---|---|
fdw_dataset |
- | BigQuery dataset name |
fdw_table |
- | BigQuery table name |
fdw_convert_tz |
- | Convert BigQuery time zone for dates and timestamps to selected time zone. Example: 'US/Eastern' . |
fdw_group |
'false' |
See Remote grouping and counting. |
fdw_casting |
- | See Casting. |
fdw_verbose |
'false' |
Set to 'true' to output debug information in PostrgeSQL's logs |
fdw_sql_dialect |
'standard' |
BigQuery SQL dialect. Currently only standard is supported. |