-
I have a script that takes a full gmvault-db and creates a Maildir-based mbox for a given tag. Having GMVault pre-cache in its directory a list of possible tags would be nice (json for example). I c…
-
In my spare time I am starting to add support for fatcat (https://fatcat.wiki) metadata to biblio-glutton. To start this would probably be a branch, then see if it make sense to upstream.
As i'm po…
-
## Current Metadata-Related Challenges in `read_parquet`
The current approach to parquet-metadata handling in Dask-Dataframe has been causing pain for many users recently. The problem is especial…
-
I have a high level design question concerning using text as a serialized representation of array metadata. In my opinion, it is not the best choice as a primary representation. Let me explain why
…
-
# Improve documentation
## Link
README.md in the repo
## Describe the problem
It is unclear whether one is supposed to clean up the `net._http_response` table or if there is a task that d…
-
[8 Common Data Structures every Programmer must know](https://towardsdatascience.com/8-common-data-structures-every-programmer-must-know-171acf6a1a42)
1. Arrays
2. Linked Lists
3. Stacks
4. Queues…
-
## Bug
Using SQL it is not possible to `CREATE` or `INSERT` into a table with `GENERATED` columns with spark 3.2.1 and delta 1.2.0.
### Describe the problem
#### Steps to reproduce
1. download…
-
## Feature request
**Is your feature request related to a problem? Please describe.**
当前 enable_fast_schema_evolution 参数不支持动态配置,只有建表的时候才可以设置,这样的话,导致历史表无法修改,无法实现 fast schema change, 对于一直使用的…
-
Instead of requiring an ssh connection to run a provisioner on a docker container, it would be nice to just do a `docker exec` so that we don't need to set up an ssh daemon on the container.
(I know …
-
Hi all,
Im currentley facing some troubles trying to write on a partitioned table due to differences between schemas evolutions. In example, I have a Dataframe partitioned by column source with thi…