Niels here from the open-source team at Hugging Face. I discovered your work through the daily papers: https://huggingface.co/papers/2409.09213 (feel free to claim authorship with your HF account). I work together with AK on improving the visibility of researchers' work on the hub.
I was wondering you'd be up for collaborating on releasing the checkpoints on the 🤗 hub, rather than Google Drive, to improve their discoverability. We can add tags so that people find them when filtering https://huggingface.co/models.
See here for a guide: https://huggingface.co/docs/hub/models-uploading. In case the models are custom PyTorch model, we could probably leverage the PyTorchModelHubMixin class which adds from_pretrained and push_to_hub to each model. Alternatively, one can leverages the hf_hub_download one-liner to download a checkpoint from the hub.
We encourage researchers to push each model checkpoint to a separate model repository, so that things like download stats also work. Moreover, we can then link the checkpoints to the paper page, improving their visibility.
Let me know if you're interested/need any help regarding this!
Hi @Sreyan88,
Niels here from the open-source team at Hugging Face. I discovered your work through the daily papers: https://huggingface.co/papers/2409.09213 (feel free to claim authorship with your HF account). I work together with AK on improving the visibility of researchers' work on the hub.
I was wondering you'd be up for collaborating on releasing the checkpoints on the 🤗 hub, rather than Google Drive, to improve their discoverability. We can add tags so that people find them when filtering https://huggingface.co/models.
For instance in this case, the "feature-extraction" tag seems useful, as it's also used by CLAP, e.g. https://huggingface.co/laion/clap-htsat-unfused
Uploading models
See here for a guide: https://huggingface.co/docs/hub/models-uploading. In case the models are custom PyTorch model, we could probably leverage the PyTorchModelHubMixin class which adds
from_pretrained
andpush_to_hub
to each model. Alternatively, one can leverages the hf_hub_download one-liner to download a checkpoint from the hub.We encourage researchers to push each model checkpoint to a separate model repository, so that things like download stats also work. Moreover, we can then link the checkpoints to the paper page, improving their visibility.
Let me know if you're interested/need any help regarding this!
Cheers,
Niels ML Engineer @ HF 🤗
cc @Vaibhavs10