scicode-bench / SciCode

A benchmark that challenges language models to code solutions for scientific problems
Apache License 2.0
67 stars 7 forks source link

Improve discoverability of your work on Hugging Face #5

Open NielsRogge opened 1 month ago

NielsRogge commented 1 month ago

Hi,

Niels here from the open-source team at Hugging Face. I discovered your work through the paper page: https://huggingface.co/papers/2407.13168 (feel free to claim the paper so that it appears at your HF account!).

However there are a couple of things which could improve the discoverability of leaderboard + dataset, which I've listed below.

Make dataset available on the hub

Would you be interested in making your dataset available on the hub?

This way, people could load the dataset in 2 lines of code, like so (assuming they got access through the manual approval and are logged in with their HF account):

from datasets import load_dataset

dataset = load_dataset("your-hf-organization/scicode")

Moreover, we support gated access which requires people to fill in their contact information (similar to how models like LLaMa-3 are gated): https://huggingface.co/docs/hub/en/datasets-gated

Leaderboard as an app on Spaces

Secondly, if you're interested, the leaderboard could be hosted on Spaces using Streamlit/Gradio/Docker app. See here for more info: https://huggingface.co/docs/hub/en/spaces-overview

Let me know if you need any help regarding this!

Cheers,

Niels ML Engineer @ HF 🤗

ofirpress commented 1 month ago

I agree, I think it would be useful to have this on HF.