-
**Describe the bug**
`cuml.UMAP(n_epochs=n)` fails to transform() for n=1 and n=2
**Steps/Code to reproduce bug**
```python
import cudf
import cuml
df = cudf.DataFrame({'a': range(0, 100)})
for n …
-
Problem: cant figure out how to load training set
catboost version: 0.24.3
Operating System: ubuntu 20.04
CPU: i7 4790k
# GPU: gtx 1070
Here is my code
```
y = train_df['answered_correctly'…
-
In SKLearn, I am able to do the following (using the Iris dataset, which I downloaded from [here](https://github.com/dask-contrib/dask-sql/blob/2010dbd6471457dd50da31385b762f0ebbc3c91f/notebooks/iris.…
-
In https://github.com/rapidsai/dask-cuda/pull/226#pullrequestreview-348129792 an interesting conversation is happening between folks which I'd like to bubble up as an issue unto itself -- that is, why…
-
Hi there,
Just wondering if there's scope for a `to_cudf` type functionality so that users can read Parquet files directly into GPU memory (bypassing the CPU). This would be using the [`cudf.read_p…
-
**Describe the bug**
Ideally, we should eventually support `engine="cudf"` and `force_ascii=False` together with `to_json`. For now, we should update the documentation and/or provide a warning for us…
-
Running the cuDF benchmark with RAPIDS 22.06 results in the following:
RAPIDS 22.06 cuDF benchmark
```
$ python dask_cuda/benchmarks/local_cudf_merge.py -d 1,2 -c 100_000_000
2022-06-16 08:21:…
-
@charlesbluca [recently added](https://github.com/rapidsai/cudf/pull/8153) a new serialization method for cuDF Dataframes where data is now serialized into two buffers: one for metadata and another fo…
-
The top_k expression is essentially an nlargest + limit operation and [is sometimes used to short circuit](https://www.snowflake.com/en/blog/super-fast-top-k-queries/) sort + limit scenarios common in…
-
The `top_k` expression can be used as a reduction on primitive types, but is often used alongside group_by and will return a nested type.
```python
import polars as pl
from functools import parti…