-
## Description
Hello,
I have the same resuts if I start 2 times the same training on my big dataset (a bin file).
I have different results if I start a new training from a saved model
**Details…
wil70 updated
2 months ago
-
## 1. The entire URL of the file you are using
https://github.com/tensorflow/models/blob/master/research/object_detection/model_main_tf2.py
## 2. Describe the bug
Training SSD object detectio…
-
Hi! We're testing your code on our dataset, the training iterations go on smoothly, but the validations in between the iterations are like extremely slow (4000 images in like 6 hours). In addition, ev…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [ ] I am using the latest TensorFlow Model Garden release and TensorFlow 2.
- [x] I am reporting…
-
# accelerate_config with num_processes == 3
> compute_environment: LOCAL_MACHINE
debug: true
deepspeed_config:
gradient_accumulation_steps: 2
gradient_clipping: 1.0
offload_optimizer_devi…
-
### 🐛 Describe the bug
I was trying to run Bert model training on ICELAKE CPU with torch.compile mode then it is giving a value error, but when i am running it with eager mode then it is running fi…
-
During the `ilab config init` and machine with H100 GPUs. ( a3-highgpu-8g to be specific in GCP ) will detect the H100 as being a A100.
This could raise doubts on proper identification of system.
`…
-
Hello, I had tried to train custom data set using CPU, but i got message like this
Anyone, can help me to solve it?
i had tried add --device cpu when running the program
/site-packages/torch…
-
Hello, after completing the training of the model, I don't know how to choose the right ckpt. So, I would appreciate it if I could answer any questions.
1. When evaluating and testing, do you execu…
-
Considering that this program can use CPU and low VRam cards to train, how about adding a way or parameter to continue training from saved splat.ply? Is this even feasible?