-
the latent_channels in `scripts/vae/sevirlr/cfg.yaml` is 64 but the latent_channels in the paper's Implementation Details is 4.
Will it reduce the training time during the denoise when the latent_ch…
-
1. mainly follow what the main Inference group have and replace SD XL with 1.5
2. we don't have 1.5 keras or tflite, we can start with 1.4 tflite files (1.5 and 1.4 have the same architecture, just d…
-
Thank you the nice tutorial and supporting code. I made a plot (attached) of KL Loss vs iterations of your implementation and that of Keras ([blog](https://blog.keras.io/building-autoencoders-in-keras…
-
I converted a model using:
```
python -m python_coreml_stable_diffusion.torch2coreml --convert-unet --convert-text-encoder --convert-vae-decoder --convert-vae-encoder --convert-safety-checker --bu…
-
Hey Ethan
My name is Yi Zheng. Currently I also implemented the e2c using tensorflow [e2c implementation](https://github.com/ZhengYi0310/other_stuff/blob/gh-pages/deeprlhw/hw1/e2c.py).
To …
-
Hello! Thank you for your great paper and for publishing the code and checkpoints for the t2v models. While reading about how it all works, I had a number of questions. I hope you'll find some time to…
-
Todo List
- [x] enlarge hidden size
- [x] look into the implementation
- [ ] Check Jason T2T code
- [ ] Steal Jason's T2T dataset (perhaps his dataset is better )
- [x] weight average ?
Che…
zomux updated
4 years ago
-
I am trying to reimplement your code in PyTorch and I need to know what is the difference between your loss function and the loss function regarding the vanilla VAE? Based on my experience, your KL-di…
-
https://autoencoded-vocal-analysis.readthedocs.io/en/latest/index.html
https://elifesciences.org/articles/67855
https://github.com/pearsonlab/autoencoded-vocal-analysis/tree/master
- [ ] Add VAE …
-
Hi,
I was looking at the current implementation, and was noticing that before every generation you pass all reference images through the VAE as one batch. After a certain amount of references image…