-
Hi, many thanks for your great work.
I am trying to use the default script for training. I find that even if I use batch_size=1, training runs out of memory. I am wondering what might cause the pro…
-
@AdamWill reported on Freenode:
```
bowlofeggs: i seem to be having rather a lot of trouble getting edits to update texts to work at present?
any know issues?
for e.g. i just edited https://b…
-
**Issue Type**
Bug
**Source**
binary
**Tensorflow Version**
2.8
**Custom Code**
No
**OS Platform and Distribution**
Linux
**Python version**
3.9
**Current Behaviour?**
Hi te…
-
### System Info
Running a standard training loop where I save the optimizer state_dict using opt.state_dict().
Upon loading using opt.load_state_dict() to resume, the model immediately NaNs after th…
-
**Describe the bug**
I've noticed a couple of minor inconsistencies with the Deepspeed provided optimizers.
1. I expect CPU adam and GPU adam to have identical naming. However, CPU adam has `model…
-
### System Info
Platform: Linux-5.15.148.2-2.cm2-x86_64-with-glibc2.35
Python version: 3.10.14
Bitsandbytes version: 0.43.1
Safetensors version: 0.4.5
Accelerate version: 0.34.2
Accelerate con…
-
AdamW differs with Adam only in the weight_decay handling. Everything else is the same. We should reuse code instead of hosting the same exact logic in two places.
One way to do this is to have cer…
-
### 🐛 Describe the bug
Keep getting this error.
```
Expected all tensors to be on the same device, but found at least two devices, cuda:7 and cpu! (when checking argument for argument state_steps…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…
-
### Bug
训练正常,但是使用`inference.py`推理或者转换ONNX时报错:
```
Traceback (most recent call last):
File "/home/code/Relation-DETR/inference.py", line 165, in
inference()
File "/home/user/code/Relati…