-
## 🚀 Feature
Support strategy to train large model that exceeds GPU mem and DRAM mem.
## Motivation
Models [e.g. **Factorization Machines**(1) or **DeepFM**(2)] in **recommendation task**s are us…
-
### System Info
```Shell
Hello everybody. I keep encountering the same issue: I use '1.12.1+cu102'and FastAI '2.7.9'.
I need to use the multiple GPUs in our server to train deeper networks with more…
-
I am using the code from the multinode.py (from this DDP tutorial series https://www.youtube.com/watch?v=KaAJtI1T2x4) file with the following Slurm Script
```!/bin/bash
#SBATCH -N 2
#SBATCH …
-
Hi, thank you for your work first.
I'm trying to run the model on another scRNA+scATAC-seq dataset following the steps in the 10x_pbmc_demo.
First steps correctly running .
```
trainer.model.eva…
-
### System Info
```Shell
`Accelerate` version: 0.22.0
Platform: Linux-5.10.192-183.736.amzn2.x86_64-x86_64-with-glibc2.29
Python version: 3.8.10
Numpy version: 1.23.1
PyTorch version (GPU?): 2…
-
### What happened + What you expected to happen
I converted existing code working on 2.7 to 2.20 (new api)
The error:
File "/opt/project/trading/training/model/rl/multi_agent/ppo/equity/trainer…
-
## Description
Hello,
I followed the instructions to compile LightGBM with GPU for Windows (x64), and I was able to compile and run the code. TY!
Unfortunatly I encountered the following exceptio…
wil70 updated
2 months ago
-
Hi, i met the issue. Thank you.
`(venv) root@autodl-container-8a50119a52-f09cc96a:~/autodl-tmp/cvt2distilgpt2# dlhpcstarter -t iu_x-ray -c config/train_iu_x_ray_chen_cvt2distilgpt2.yaml --stages_mo…
-
Could you please help me on how to use the new GPU options? I used the flags `-gpuid 0 1 -gpu_verbose 0 -gpu_rank 0` for the trianing script which resulted in the following error
```
Traceback (mos…
-
#### Issue Description
Please describe our issue, along with:
Expected Behavior: It would take approximately the same amount of time when using a LocalFileModelSaver to save a MultiLayerNetwork to…