-
This looks like a really good big data munging / query benchmark, and it's getting a lot of attention on Twitter:
https://cloud.google.com/blog/big-data/2016/05/bigquery-and-dataproc-shine-in-indep…
-
```
What steps will reproduce the problem?
1. Create a table with a base schema and update the schema using the update API
2. Immediately stream rows based on the new schema.
3. The error seen is {u'…
-
Migrate the following data sources to new form styling that can be found [here](https://developers.grafana.com/ui/latest/index.html?path=/story/forms-form--basic)
**Metrics**
- [ ] Prometheus
- […
-
BigQuery don't support sampling with order function in sqlalchemy. By sampling I mean raw SQL looks like this SELECT * FROM dataset.my_table TABLESAMPLE SYSTEM (10 PERCENT).
**Describe the soluti…
-
### What happened?
When writing to BQ with streaming inserts, we do some serializing to JSON and int values can have a maximum value of < 2^64. Writing ints of higher value than this results in a `Ty…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…
-
### What happened?
The cross language BigQuery IO doesn't appear to support time.Time type fields, preventing use in some pipelines. This should be confirmed via a test, fixed and the package docum…
-
With `dbt` it is possible to run pyspark jobs using Google Dataproc, both in "cluster" and "serverless" mode (see the "BigQuery" section in https://docs.getdbt.com/docs/build/python-models#specific-da…
-
### What is it?
In the future, we may have data spread out among a bunch of places (e.g. BigQuery, Clickhouse, Postgres, random files, IPFS). Trino seems like an interesting option for running distri…
-
Hello,
I'm doing migration form fluentd 0.12 to 1.0
I have error message like " message="Error while reading data, error message: JSON table encountered too many errors, giving up. Rows: 1; erro…