-
你好,你们的工作自称是“zero-shot”可是却需要训练,跟 ReCLIP 的 setting 完全不一致啊,这该怎么解释?难道审稿的时候没有审稿人质疑?论文当中对训练的方式和数据也没有解释清楚,还故意放补充材料。
your work is labeled as "zero-shot," but it requires training, which contradicts the ReC…
-
Why laion2b_e16(ViT-B-32::laion2b_e16) does perform well in Chinese/English search? e.g. "猫/cat" "狗/dog",but performs poorly in Japanese or French
What is the composition of the dataset for model tra…
-
Could you please guide me on how to convert a ".pt" format model into the Hugging Face format (similar to the one at https://huggingface.co/laion/CLIP-ViT-g-14-laion2B-s12B-b42K/tree/main? It seems to…
-
Hi! Awesome repo, thanks for building this.
I have 10TB of the Laion dataset downloaded, thanks to your scripts! However, I'm trying to use your data loader, and ran into an issue.
In your Webd…
-
Can it be search the API by the indice_name by laionFace (https://github.com/FacePerceiver/LAION-Face)?
Example:
client = ClipClient(
url="https://knn5.laion.ai/knn-service",
#indice_n…
-
I download the cc_sbu dataset and count the number, I found that the total number is 12M and the success is more than 6M, which is impossible, since cc_sub+laion is just 5M as mentioned in your paper.…
-
Hi everyone! I am sorry that I just started this project and I am new to this topic. I am wondering where the code for supervised audio classification is. I just saw zero-shot learning. Thanks!
-
@karpathy Please add code for training of 125M [image-gpt](https://github.com/openai/image-gpt) to this repository. May be, also extend the context length to 2k so that we can get 45x45 pixel image in…
-
See https://huggingface.co/laion/CLIP-convnext_base_w-laion_aesthetic-s13B-b82K as an example. The model/task pair are supported, see https://github.com/huggingface/api-inference-community/blob/main/d…
-
audioldm does never finish it's audio generation i don't even think it really starts it.
i waited over 2000 seconds which is over 30 minutes but nothing happens just counting seconds.
my system:
…