-
Hi, thanks for your great work. I have the following two questions:
1. Why do you set the `epochs=3` during training and inference? And do you suggest me to set it to a higher value(like 10, 20, etc)…
-
I found with 2d5-7b the checkpoint saved from LoRA tuning finetune.py with one GPU is correct, while with multiple GPU the model saved is incorrect.
Does anyone met similar problem?
For example…
-
Hello, when I run the model at regular intervals (e.g. 1 day), the data time costs a lot of time not only in the first epoch, but also in subsequent epochs, do you know how to solve it?
-
Traceback (most recent call last):
File "train.py", line 277, in
batch_loss_n, pred = solver.optimize(index+1,epoch)
File "/home/jayakumar/MSMDFF-NET-main/utils/frame_work_general.py", lin…
-
### 🐛 Describe the bug
File :OLMo/olmo/train.py
In the following training loop, we will break our pre-training for only 1 epoch ?
```
@property
def max_epochs(self) -> int:
if isinstance(se…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…
-
Hi, Thanks for open-sourcing your notebook. I am trying to run directly the Pytorch training part in the notebook without running the Query Embedding Network part. These are steps I took.
**1 I add…
-
This is a great work for 3D pose estimation. When I train, I find that the best result I save in the first 100 epochs is the 95th epoch, but after I train 200 epochs I get the 156th epoch as the best …
-
What is the batch size and eopch when training 16*16->128*128 SRs on two 24GB NVIDIA RTX A5000 GPUs on the FFHQ dataset? How many days did you train at a time? Thanks.
-
### Search before asking
- [X] I have searched the HUB [issues](https://github.com/ultralytics/hub/issues) and found no similar bug report.
### HUB Component
Training
### Bug
I have b…