Unable to access Google Cloud Storage bucket

abdulhaim / LMRL-Gym

MIT License

64 stars 9 forks source link

Unable to access Google Cloud Storage bucket #1

Closed sbhambr1 closed 5 months ago

sbhambr1 commented 7 months ago

Hi, I am unable to run the first command to train the BC model using: python llm_rl_scripts/maze/bc/fully_observed_bc.py HF gpt2 gcs://rl-llm-bench-dataset-internal/maze/fully_observed_maze_data.jsonl --outputs-path bc_checkpoint_path

I am receiving the following error:

OSError: Forbidden: b/rl-llm-bench-dataset-internal/o username@email.com does not have storage.objects.list access to the Google Cloud Storage bucket. Permission 'storage.objects.list' denied on resource (or it may not exist).

Any help is appreciated!

sbhambr1 commented 7 months ago

As a followup on the details of the issue, I also checked https://rail.eecs.berkeley.edu/datasets/rl-llm-bench-dataset/ which also does not provide access to download the datasets.

DT6A commented 7 months ago

Having the same issue. Any updates on that?

icwhite commented 7 months ago

Hi! Thank you for bringing this to our attention. The python llm_rl_scripts/maze/bc/fully_observed_bc.py HF gpt2 gcs://rl-llm-bench-dataset-internal/maze/fully_observed_maze_data.jsonl --outputs-path bc_checkpoint_path command should work now that we have properly configured permissions.

We are still working on making the https://rail.eecs.berkeley.edu/datasets/rl-llm-bench-dataset/ bucket accessible to outside users. This will be resolved within the week.

Thank you for your patience!

icwhite commented 7 months ago

We have made the bucket private again. Please download the datasets from this link: https://rail.eecs.berkeley.edu/datasets/rl-llm-bench-dataset/