-
Partitioning is a part of Iceberg and DeltaLake protocol specification. It is already available in catalog implementation (https://github.com/ClickHouse/ClickHouse/blob/master/src/Storages/ObjectStora…
-
Build a UI and backend workflow for automatically saving a dataset into ERIC.
-
### What would you like to be improved?
In addition to regular table functions, datalake table formats offer various capabilities for inspecting tables.
For instance, Iceberg can display valid sna…
-
## Describe the solution you'd like
in the bots, i would like to be able to choose an existing lambda function to query Amazon Athena, to query data sources, so i can ask Claude about my data, and …
-
ParquetOutputFormat should support custom OutputCommitter.
There is a need to bypass current Hadoop functionality of writing output data under **_temporary** folder. Especially with AWS S3, there can…
-
With databricks becoming one of the primary processing solutions for SAS migration, let's add the extension to AAW images by default. The latest version of databricks is required to allow mounting of …
-
This is a know bug in leafet https://github.com/Leaflet/Leaflet/issues/7255
Need to update leaflet to a newer version (will also solve https://github.com/eawag-surface-waters-research/datalakes-rea…
-
There are several great books that could be added to the sample dataset, including such classics as:
The Invisible TAM
The Book of Job Scheduling
Stranger in a Strange LAN
20000 Leagues Under th…
-
Hi Team, hopefully this is right place to ask, if not, I'd appreciate if you can direct me.
I'm the founder of [cloudquery.io](https://www.cloudquery.io/), a high performance open source ELT framew…
-
Hi Team, hopefully this is right place to ask, if not, I'd appreciate if you can direct me.
I'm the founder of [cloudquery.io](https://www.cloudquery.io/), a high performance open source ELT framew…