-
Hello,
I am trying to run your code but I get this error:
TypeError: cannot pickle 'weakref' object
Can you please help me with it?
Thanks
-
### Problem
Some **tensorflow** training unit tests require the following extra config in `conftest.py`, especially for **spark** backend. Some **ray** backend uts also use this config.
```python
c…
-
Hi! I followed the conda installation and I am using Jupyter notebook in WSL2. System:
32GB RAM
RTX 3090 24GB
Ryzen 5 5600x
- No issues with gradient = True.
- I installed the latest version fr…
-
When I initialize my TFT trainer to use multiple GPUs
```
# Configure network and trainer
pl.seed_everything(407)
trainer = pl.Trainer(
gpus = [0],
gradient_clip_val = 0.1 # hyperparam …
-
In the multi-container exercise, we base the exercise on wordpress:5.7, which is a supported and updated tag on Docker Hub. Unfortunately this doesn't really make a difference, since we ask trainees t…
-
### System Info
```Shell
Copy-and-paste the text below in your GitHub issue
- `Accelerate` version: 0.12.0
- Platform: Linux-5.4.0-105-generic-x86_64-with-debian-buster-sid
- Python version: …
-
### Description
I trained a NN on CPUs multiple times. At that time no GPU was detected on my machine.
I installed Cuda using pip with the hope of using GPUs. After that, I ran the same code to trai…
-
I hope this message finds you well. I recently had the opportunity to experiment with the Codellama-7b-Instruct model from GitHub repository and was pleased to observe its promising performance. Encou…
-
I meet the same problem with https://github.com/pytorch/fairseq/issues/3308 when I try to running on a single machine with multiple GPU. I can run the project successfully with single gpu. However, fa…
-
## Proposed refactor
### Motivation
Strategies today interact with dataloading, especially in distributed training. It makes sense for the strategy to directly handle this logic.
This wo…