-
Dear Team Deepmind,
I am really grateful that you shared a vqvae_example with sonnet2. However, when running it, I currently encounter a problem of NAN vqvae loss from the beginning. The outcome is…
-
hey @adelacvg thank for sharing the code
after reading the code i want to ask you few question about new 24k model if you dont mind
1. what make different about this model from previous one (ht…
acul3 updated
2 months ago
-
Dynamorph currently loads all training data to memory at once, and samples data for each mini-batch with a single process. Training could potentially be sped up using [pytorch data loader ](https://py…
smguo updated
3 years ago
-
The paper mentions a codebook size of 4096 for all models with 128/64/32 tokens for 256x256 and 128/64 tokens for 512x512.
I was wondering why the example configuration in `README.md` and `titok.py` …
-
How to get the dataset that was used to train the examples? What's it's format? (Like the folder hierarchy etc, do they use JPEG or PNG, what are the images called (are they numbered in the image file…
-
I have tried run 'python tain_vqvae.py --path '\home\lab\ffhq_dataset' 'in terminal, but there is a error 'module 'torch.distributed' has no ttributed 'launch' '.
I read some other distributed train…
-
Hello! Your work is excellent! I trained my own data set with your method, but I don't know how to test the effect of my trained checkpoint, can I only check the results saved during training in the f…
-
## ❓ Questions
Hi, thank you for a great work.
(I)
I could not figure out necessity of predicting codebook logits via transformer.
Why could not we use empirical distribution of codebook us…
-
I have used this resnet VAE to reconstruct images from a dataset (test and train splitted) but the images are not being reconstructed at all. I have used 50 epochs to train the model and test it on se…
-
Something to consider as a highly speculative research project once Ganbooru is done.
---
A distinct trend in recent DL has been self-attention moving beyond sequence or text data to image data …
gwern updated
4 years ago