ai-forever / ru-dalle

Generate images from texts. In Russian
https://rudalle.ru/
Apache License 2.0
1.64k stars 244 forks source link

ImageNet classification with ru-dalle? #60

Open prnvjb opened 2 years ago

prnvjb commented 2 years ago

Hi Team, Thanks for the excellent contribution to open source. I've been trying to adapt your code. I'm mostly focused on getting image embeddings from the given image and train a classifier on top of it. I guess dalle code is composed on text and image embeddings. Any direction on generation image feature vector, what part of code I should modify?

Any help would be greatly appreciated.

Thanks.

AlexWortega commented 2 years ago

You can try to use vqgan image encoder and mlp head for classification. but better to use VIT/RN50 and other

prnvjb commented 2 years ago

Thanks for the response. I just want to check ru-dalle's image encoder performance on zero shot image classification.