-
do you know code example to run fully locally , without spark?
kind of online learning?
-
Hi,
Wondering if there were more benchmark available for bidmach with terabytes size data,
Using Kylix or not for popular ML algos ?
Thanks
ghost updated
7 years ago
-
zipped
ipinyou is 249 MB
and uzipeed 1.5 gb
in
https://drive.google.com/drive/folders/1thXezQbmuS6Q8-AXmrhB0tLM3mybJxVR?usp=sharing
but
https://github.com/wnzhang/make-ipinyou-data
stated th…
-
**Describing the bug**
While launching the docker container of Merlin Training using NGC, I'm usually getting a
`set_mempolicy: operation not permitted` exception repeatedly. I'm not sure how to ge…
-
**What needs doing**
Improve user experience to learn how to do large joins. Customer feedback is that `JoinExternal` suggest that the operator can be used for joins between two large dataframes. In …
-
While running the Nvidia code for DLRMv2 on a 4090 GPU with batch size 1400, we are seeing the below accuracy which is lower than expected. Can someone help us if we are missing something? We have tri…
-
Criteo dataset
40M dataset - 5000 threes
```
python3 src/criteo_speed_test.py xgboost ; python3 src/criteo_speed_test.py lightgbm; python3 src/criteo_speed_test.py arboretum
reading data....
…
sh1ng updated
5 years ago
-
In order to make the most of our time at the [scaling scikit-learn sprint](https://scisprints.github.io/#may2-june-joint-scikit-learn-scikit-image-dask-sprint) it might be helpful to prepare some chal…
-
Reading [these docs](https://criteo.github.io/autofaiss/getting_started/quantization.html) it appears as though one can set the entry IDs when using parquet by setting `–id_columns`. How does one set …
-
Hi Teams,
I have run the default training script of DLRM v2 to train the model, however, the GPU I used doesn't have enough memory for the default setting. I just modified the training script with th…