-
I understand this was probably a project for your personal use, but I found this approach for background removal much different from other better known ones like Segmentation, etc. and wanted to give …
-
When I train with a dataset with image size: (1280 by 720) and change the batch size to either 1, 10 or 16, I get the following error:
InvalidArgumentError (see above for traceback): ConcatOp : Dim…
-
The UDPipe sentence splitter seems to be a bit too split-happy, creating many fragments. Is this dragging down performance of our BERT models? Furthermore, we put a lot of effort into splitting large …
-
I have seen your demo dssm/single/dssm_v3.py, and want to know how your data be organazed. For example, what the format of query.train.pickle ?
-
Running `python vae_train.py` prompts a memory error on my system. I felt bad about this, but after running the numbers, vae_train.py needs to allocate ~125 GB of memory to this array!
```
>>> imp…
-
For lack of a better word; I've noticed during training that the VitGAN tends to get stuck on one, two, or three (i don't see four happen very often/at all) "positional blobs" for lack of better words…
-
How to train with different pre-trained models.
please reply.
-
-
Is this speed normal? If it takes two hours to complete 4 epochs, then wouldn't it take nearly 250 hours, or about 11 days, to complete a total of 500 epochs? Is this normal for celeba(256x256) datas…
-
Hi Flash-Attention Team, Are there any plans to support Attention Sink style (https://arxiv.org/pdf/2309.17453v1.pdf) attention maps for causal language models? TIA!