vq-vae Search Results - Githubissues

458 results
for vq-vae

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lucidrains/DALLE-pytorch #354

Aspiring to go from VQ-VAE -> DALLE on Google Conceptual Cap…

Hi, I recently read [this](https://ml.berkeley.edu/blog/posts/clip-art/) blog and was fascinated by the potential of these generative models. I am hoping to learn the fundamentals, reimplement models…

appliedml42 updated 2 years ago
7
yccyenchicheng/AutoSDF #19

Reconstruction loss in supp

Hi, Thank you for great work. I have question about the Eq.1 of supp. $\mathcal L_\text{VQ-VAE}=-\log p(X|\mathbf Z) + \|\| \text{sg}[\hat{\mathbf Z}]-\mathbf Z\|\|^2_2+\|\| \hat{\mathbf Z} - \tex…

Kitsunetic updated 1 year ago
2
rosinality/vq-vae-2-pytorch #43

A question about image crispness with VQ-VAE2

Hello, first of all thanks for this interesting implementation of VQ-VAE 2 paper. I can train this network on a dataset of mine, however reconstructed images are a little bit blurry. Quality is goo…

francoisruty updated 1 year ago
11
PKU-YuanGroup/Open-Sora-Plan #93

Question about latent size

hi, this project use VQVAE to compress video into small latent space, and latent embedding dim is `512` or `256`. But in LDM, they usually use very small embedding dim `4` or `3`, SD use `4`. Will th…

Birdylx updated 4 months ago
3
google-deepmind/sonnet #257

I have a question about the perplexity term (in the VQ-VAE).

As far as I understood, the perplexity used in this repo's VQ-VAE is kind of "meaningfully used codebook token numbers". When only one codebook token is used, perplexity is 1. When all codebook to…

SeongYeonPark updated 1 year ago
2
lucidrains/titok-pytorch #2

tokens/codebook size

The paper mentions a codebook size of 4096 for all models with 128/64/32 tokens for 256x256 and 128/64 tokens for 512x512. I was wondering why the example configuration in `README.md` and `titok.py` …

kanttouchthis updated 1 month ago
10
easylearningscores/PastNet #1

How does the VQ-VAE reduce the training cost of the proposed…

Interesting work! However, in the DST module, the encoded feature maps with the shape of [T, C, \hat{H}, \hat{W}] is quantified into feature map with the shape of [T, D, \hat{H}, \hat{w}]. It is reall…

bigfeetsmalltone updated 11 months ago
1
wshilton/andrew #7

Variational autoencoder design

For purposes of utterance encoding, gesture encoding and facial expression encoding, we shall first appeal to a nonlinear solver technique for extracting the relevant features due to what is expected …

wshilton updated 1 year ago
6
kai-wen-yang/CD-VAE #10

Could you please share the pretrained detection model for Im…

csjunjun updated 1 year ago
1
lucidrains/DALLE-pytorch #328

Train with taming's VQGAN

Hi, I was training on my own data set with taming's VQ-GAN. There's some error with the dimension with the vae model ![image](https://user-images.githubusercontent.com/85055246/124013852-68a8e680-…

WocMaker updated 2 years ago
8

上一页 1...4 5 6 7 8 9 10...46 下一页

458 results for vq-vae

458 results
for vq-vae