lacclab / Goal-Decoding-from-Eye-Movements

Repo for the paper "Decoding Reading Goals from Eye Movements"
MIT License
1 stars 0 forks source link

Release data on Hugging Face #1

Open NielsRogge opened 3 weeks ago

NielsRogge commented 3 weeks ago

Hello @OmerShubi 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through AK's daily papers, the paper page is available here: https://huggingface.co/papers/2410.20779. The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF.

Would you like to host the datasets you've released on https://huggingface.co/datasets? I see you're using Github for it. Hosting on Hugging Face will give you more visibility/enable better discoverability, and will also allow people to do:

from datasets import load_dataset

dataset = load_dataset("your-hf-org-or-username/your-dataset")

If you're down, leaving a guide here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, useful for image/video datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.

Besides that, there's the dataset viewer which allows people to quickly explore the first few rows of the data in the browser.

After uploaded, we can also link the datasets to the paper page (read here) so people can discover your work.

Let me know if you're interested/need any guidance.

Kind regards, Niels

OmerShubi commented 3 weeks ago

Hi Niels, Thank you for reaching out, The textual dataset that is used in this work was upload to HuggingFace a while ago by me - malmaud/onestop_qa · Datasets at Hugging Facehttps://huggingface.co/datasets/malmaud/onestop_qa. If we are already talking, I opened a PR to update the data, but was unable to merge it - malmaud/onestop_qa · update-datahttps://huggingface.co/datasets/malmaud/onestop_qa/discussions/2, any tips on how to continue from here? Additionally, we use eyetracking-in-reading data. Is there a convenient way to store it (csv files) on HuggingFace? Our current plan is to use OSF.

Thanks, Omer


Omer Shubi Ph.D. Candidate Language, Computation and Cognition Lab Technion – Israel Institute of Technology omershubi.github.io

From: NielsRogge @.> Date: Monday, 4 November 2024 at 23:46 To: lacclab/Goal-Decoding-from-Eye-Movements @.> Cc: Omer Shubi @.>, Mention @.> Subject: [lacclab/Goal-Decoding-from-Eye-Movements] Release data on Hugging Face (Issue #1)

Hello @OmerShubihttps://github.com/OmerShubi 🤗

I'm Niels and work as part of the open-source team at Hugging Face. I discovered your work through AK's daily papers, the paper page is available here: https://huggingface.co/papers/2410.20779. The paper page lets people discuss about your paper and lets them find artifacts about it (your dataset for instance), you can also claim the paper as yours which will show up on your public profile at HF.

Would you like to host the datasets you've released on https://huggingface.co/datasets? I see you're using Github for it. Hosting on Hugging Face will give you more visibility/enable better discoverability, and will also allow people to do:

from datasets import load_dataset

dataset = load_dataset("your-hf-org-or-username/your-dataset")

If you're down, leaving a guide here: https://huggingface.co/docs/datasets/loading. We also support Webdataset, useful for image/video datasets: https://huggingface.co/docs/datasets/en/loading#webdataset.

Besides that, there's the dataset viewerhttps://huggingface.co/docs/hub/en/datasets-viewer which allows people to quickly explore the first few rows of the data in the browser.

After uploaded, we can also link the datasets to the paper page (read herehttps://huggingface.co/docs/hub/en/model-cards#linking-a-paper) so people can discover your work.

Let me know if you're interested/need any guidance.

Kind regards, Niels

— Reply to this email directly, view it on GitHubhttps://github.com/lacclab/Goal-Decoding-from-Eye-Movements/issues/1, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AGZEDZ56XS33XIWZ63YPJHLZ67TM5AVCNFSM6AAAAABRFFUEKWVHI2DSMVQWIX3LMV43ASLTON2WKOZSGYZTGOJRGY2TKMA. You are receiving this because you were mentioned.Message ID: @.***>

NielsRogge commented 3 weeks ago

Thanks for the context. Yes, csv files can be loaded as shown here: https://huggingface.co/docs/datasets/en/loading#csv, after which you can call dataset.push_to_hub("your-hf-org/your-dataset"). The csv file will then be displayed in the browser.

OmerShubi commented 3 weeks ago

Thanks!

From: NielsRogge @.> Date: Tuesday, 5 November 2024 at 17:47 To: lacclab/Goal-Decoding-from-Eye-Movements @.> Cc: Omer Shubi @.>, Mention @.> Subject: Re: [lacclab/Goal-Decoding-from-Eye-Movements] Release data on Hugging Face (Issue #1)

Thanks for the context. Yes, csv files can be loaded as shown here: https://huggingface.co/docs/datasets/en/loading#csv, after which you can call dataset.push_to_hub("your-hf-org/your-dataset"). The csv file will then be displayed in the browser.

— Reply to this email directly, view it on GitHubhttps://github.com/lacclab/Goal-Decoding-from-Eye-Movements/issues/1#issuecomment-2457527684, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AGZEDZ5AGBPBGMOCCAUBJ2LZ7DSBFAVCNFSM6AAAAABRFFUEKWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINJXGUZDONRYGQ. You are receiving this because you were mentioned.Message ID: @.***>