-
It's a broad course as you see!
Do you think it is OK content?
What do you mean by "remove desktop environment"?
What shall we have as prereqs?
We shall probably formulate a logical text a…
-
Dan S has a method for making RunUMAP reproducible across different machines. He is trying to submit a pull request to get this incorporated into Seurat. In short it involves implementation of the app…
-
## Enhancement
It can be used to calculate top-k from a large dataset quickly, which is expected to be much faster than plain TOP-K.
But actually it's not, and even slower than the GROUP-BY. Cal…
-
Stream:
- [x] Approx Percentile Frontend: Two Phase Stateless Simple Agg
- [x] Approx Percentile Frontend: Two Phase Vnode Based Agg
- [x] Approx Percentile Frontend: Shuffle Simple Agg
- [x] Appr…
-
**Is your feature request related to a problem? Please describe.**
I would like to be able to implement a GPU version of Spark's `approx_count_distinct` function, which uses the [HyperLogLog++](https…
-
### Description
One main issue of performance regression compared to Vanilla Spark
Like:
```
ObjectHashAggregate(
keys=[h#216:string,
e#219:string,
p#212:string,
…
-
I wish we can support approx_count_distinct function.
-
`APPROX_PERCENTILE` is supported in Presto, SingleStore, Redis, Snowflake, Vertica, Databricks, TiDB ........
It is supposed to run much faster than its exact equivalents, i.e. `PERCENTILE_CONT` and …
lmatz updated
3 months ago
-
### Is your feature request related to a problem or challenge?
See https://github.com/pydantic/logfire/issues/433, it would be great to have a `percentile_cont` function available in DF that performe…
-
### Checks
- [X] I have read the FAQ section, searched the open issues, and still think this is a new bug.
### Describe the bug you encountered:
When I try to connect to a database using the sftp p…