-
https://github.com/florasion/D-VQVAE/blob/82f319b5d2ef871abb5a6d0b5f6ac62466615e51/dataset/utils_HO3D_FPHA.py#L26
Thanks for sharing your valuable work and code! I encountered two issues while work…
-
Is it possible to update the example for vqvae to also include how to use it on raw audio and/or video data?
Thank you
-
Hi,
This is really a very good repo for learning stable diffusion from scratch. However, I found the missing scaling factor that should have been applied to latent $z$ before U-Net. It was said to …
-
Dear authors, thanks for your interesting work and plans. However, there is one question in my mind: why you choose to use VQVAE instead of VAE?
As stated both in DiT and SoRA's official website, bot…
-
111
-
First of all thank you to the author of this tool. Installed and Run successfully in 1 go.
So I installed it and used 2 commands to generate NPY and MP4.
python generation.py folder=model_weight…
-
This is the error I see when I went through all steps as the instructions, and this link helps me out:
https://github.com/CompVis/stable-diffusion/issues/72
-
Hi! Thanks for you work. In the implementation of VQVAE (https://github.com/deepmind/sonnet/blob/v2/sonnet/src/nets/vqvae.py#L89C1-L89C1), perplexity is used as an evaluation measure for VQ codebook. …
-
>“To obtain this pre-trained model, you have the option of either using the repository at https://github.com/karchkha/MelSpec_VQVAE to train the model yourself, or downloading it from the provided lin…
-
Hi, the training is vqvae_lower_foot is quite unstable, as
![rec_cnn_vqvae_lower_foot_30_0607_110753](https://github.com/PantoMatrix/PantoMatrix/assets/23240241/cac2b1b9-8656-4ba1-812f-1138f6dbe94f…