-
Hi - it seems there are large numbers of missing buildings from the Netherlands in the buildings-footprint dataset
```
catalog = pystac_client.Client.open(
"https://planetarycomputer.mi…
-
On my installation, I observe a large increase in the tags table, as the number of metrics with tags has greatly increased.
The space occupied is somewhat confusing - tables with data take up less a…
-
### Please describe why this is necessary.
Querying tables in Azure Data Lake Storage that have a lot of transactions takes forever.
This happens because the `object_store` crate is not able to…
-
Re: https://docs.dagster.io/deployment/dagster-instance#dagster-storage
### What's the use case?
My company has an existing ad hoc job execution framework written in Python, which stores its exe…
-
# Environment
- Linux
- python 3.10.10
- deltalake==0.10.2
**Environment**:
- **Cloud provider**: Azure Databricks
***
# Bug
**What happened**:
I am trying to replicate this example f…
MigQ2 updated
2 weeks ago
-
### Description of the improvement
AWS supports Hudi in most of their data services, many users leverage AWS SDK for Pandas (formerly AWS DataWrangler) to handle their data.
Since hudi-rs provides P…
kazdy updated
11 hours ago
-
I have a ruby script that pulls out statistics from Azure table storage.
Once in a while i get a 403 Forbidden (RestClient::Forbidden)
The line that always fails is:
mbData = service.query('DataAzur…
-
**Use case**
Minimize overhead of loading index for newly inserted/merged parts.
**Describe the solution you'd like**
Per table setting `load_primary_index_on_fetch` that will load PK into me…
qoega updated
2 weeks ago
-
# Environment
**Delta-rs version**: 0.19.0
**What happened**:
We write data to delta table using delta-rs with PyArrow engine with DayHour as partition column.
```
deltalake.writ…
-
> Partially related to https://github.com/scylladb/scylladb/issues/12258 , it'd be great if we can deprecate compact storage tables.
> @tgrabiec - would be happy to hear your thoughts on how we can ac…