-
Hello @rom1504.
First of all, I sincerely thanks for your great contribution to the community.
My question is `Can I try downloading LAION400M with multiple PC?`
It is due to that my PC has li…
-
![FADE-Mvtec结果](https://github.com/user-attachments/assets/ee717d98-756c-47b3-9666-9ab1de95b624)
use this command line :python scripts/run_fade.py --dataset-name mvtec --dataset-source data/mvtec -…
-
Whereas /v1/chat/completions succeeds , the same body /v1/embeddings returns a 404 for a similar body
I was hoping to get the embedding output vector for an image that uses the openbmb/MiniCPM-V-2…
-
### Feature request
It'd be great to have a lazy push to hub, similar to the lazy loading we have with `IterableDataset`.
Suppose you'd like to filter [LAION](https://huggingface.co/datasets/lai…
-
When deploying SD, it was found that there was an issue with the environment. It seems that there is a compatibility issue between Paddle and Paddlenlp. The error log is as follows:
`Traceback (most …
-
-
Hi,
Thanks for the lib!
I think the 2 main alternatives in the pytorch world are webdataset and torch data. They both support tar files as shard format.
The benefit of tar is that it's standard a…
-
Maybe this one? [NudeNet Classifier Dataset](https://academictorrents.com/details/1cda9427784a6b77809f657e772814dc766b69f5)
-
According to the BLIP2 paper:
> We adopt the CapFilt method (Li et al., 2022) to create synthetic captions for the web images... We keep top-two captions per image as training data and randomly sam…
-
Hi, thanks for this great repo!
Could share the version of LAION-400M index used for the demo website? https://rom1504.github.io/clip-retrieval. So far the only public 400M index I can find is from…
xbldi updated
10 months ago