microsoft / CodeBERT

CodeBERT
MIT License
2.25k stars 455 forks source link

Trying to run GraphCodeBert codesearch #261

Open mmaashraf opened 1 year ago

mmaashraf commented 1 year ago

Hi,

Referring this link https://github.com/microsoft/CodeBERT/tree/master/GraphCodeBERT/codesearch , I unzipped dataset, and tried to execute run.sh, But getting 403 status code on the http requests that are being done in the script. Here's the output:

bash ./run.sh 
unzip:  cannot find or open dataset.zip, dataset.zip.zip or dataset.zip.ZIP.
./run.sh: line 2: cd: dataset: No such file or directory
--2023-05-22 22:51:05--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/python.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 54.231.134.80, 52.216.238.85, 52.217.124.32, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|54.231.134.80|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:05 ERROR 403: Forbidden.

--2023-05-22 22:51:05--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/java.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.217.163.192, 52.216.238.85, 52.217.124.32, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.217.163.192|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:05 ERROR 403: Forbidden.

--2023-05-22 22:51:05--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/ruby.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 54.231.230.24, 52.216.238.85, 52.217.117.184, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|54.231.230.24|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:06 ERROR 403: Forbidden.

--2023-05-22 22:51:06--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/javascript.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.217.33.118, 52.216.238.85, 52.217.117.184, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.217.33.118|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:06 ERROR 403: Forbidden.

--2023-05-22 22:51:06--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/go.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.217.124.32, 52.216.238.85, 52.217.117.184, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.217.124.32|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:07 ERROR 403: Forbidden.

--2023-05-22 22:51:07--  https://s3.amazonaws.com/code-search-net/CodeSearchNet/v2/php.zip
Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.216.210.120, 52.216.238.85, 52.217.117.184, ...
Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.216.210.120|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2023-05-22 22:51:07 ERROR 403: Forbidden.

Please lemme know what's the fix, 403, Is there an authentication process involved? if yes, could you please redirect me the page where the instructions are given ?

Thanks

guoday commented 1 year ago

We have uploaded the new dataset.zip. Please download the repo and run the instructions again.

mmaashraf commented 1 year ago

Thank you, it seems to be working, Although it takes a while to download artifacts. Could you share what is the expected output after execution of run.sh? so that i can verify if it works?

Also, Follow-up query, https://github.com/microsoft/CodeBERT/issues/265