microsoft / CodeXGLUE

CodeXGLUE
MIT License
1.5k stars 363 forks source link

403 Forbidden error for Code-To-Text data files #156

Open albertvillanova opened 1 year ago

albertvillanova commented 1 year ago

When trying to load your Code-To-Text data files, we get a 403 Forbidden error:

albertvillanova commented 1 year ago

I see CodeSearchNet has archived their repo (11 Apr 2023) and (I guess) removed access to their S3 data files.

Antolin1 commented 1 year ago

Do you know if the CodeSearchNet dataset will be available again? :(

celbree commented 1 year ago

We also notice that CodeSearchNet is not available by their official link. So we have uploaded the original datasets to Zenodo

albertvillanova commented 1 year ago

We have contacted the authors of the dataset and propose them to host their data on the Hugging Face Hub.

I will keep you informed.

See:

CC: @hamelsmu @julianeagu

Antolin1 commented 1 year ago

perfect! thx!

albertvillanova commented 1 year ago

Finally, we are hosting the data files in the Hugging Face Hub, e.g.: https://huggingface.co/datasets/code_search_net/blob/main/data/python.zip

See PRs: