-
Thanks for your work.
I do training on celeba-hq dataset, and after 110k steps, I find that the images seem to have color problem, is there something wrong i need to do with datasets?
![64a5ac5ea0…
-
@sr-cdf -- will post updates here
-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
pass
### Reproduction
```
CUDA_VISIBLE_DEVICES="0,1,2,3,4,5,6,7" accelerate launch \
--config_fil…
-
Hi there,
I'm trying to reproduce the pre-training of the SatClip based on S100 datset. In the default.yaml, I changed the following:
- `in_channels` parameter to 13 and the `vision_layer` to `…
-
0x73696d616f
high
# Profitable liquidations and accumulation of bad debt due to earnings accumulator not being triggered before liquidating
## Summary
The earnings accumulator is not updated and c…
-
I'm trying to fine-tune the 6.7B model on my own code dataset. I am running a multinode training with fp32 precision on NVIDIA Tesla V100 GPUs with DeepSpeed ZeRO Stage 3. My training loss seems to ra…
zpx01 updated
7 months ago
-
I have a thousand of tasks in parallel, each task has two steps:
1. matrix multiplication, C[i] = A[i]*B[i], the matrix sizes are non-uniform, and (m, n, k) is in range 10 ~1024.
2. some oper…
-
### What version of Bun is running?
1.1.26+0a37423ba
### What platform is your computer?
Linux 6.6.16-linuxkit aarch64
### What steps can reproduce the bug?
- Run Bun in a containerized…
-
As stated in the paper 8 GPUs were used for training the models. As the batch size in config is set to 1 this means that the batch size for each gradient step is 8 right? So when training on 1 V100 GP…
-
One feature Form seems to be missing is a multi-threaded accumulator for the dollar variables. I imagine a code as below:
```
L F = f(x1) + f(x2) + f(x3);
id f(x?$r) = 0;
ModuleOption accum …