amplab / benchmark

Large scale query engine benchmark
https://amplab.cs.berkeley.edu/benchmark/
99 stars 64 forks source link

Is the dataset on S3 still available nowadays? #18

Open HaifengSun-Kira opened 2 years ago

HaifengSun-Kira commented 2 years ago

I try to access the dataset on my aws ec2 instance (us-east-2 region) but failed. I have configured the access key and secret key. Here are the outputs:

# ubuntu @ ip-172-31-33-149 in ~ [14:58:32] C:77
$ aws s3 ls s3://big-data-benchmark/pavlo/text/tiny/

An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied

# ubuntu @ ip-172-31-33-149 in ~ [14:58:45] C:255
$ s3cmd ls s3://big-data-benchmark/pavlo/text-deflate/tiny/
ERROR: Access to bucket 'big-data-benchmark' was denied
ERROR: S3 error: 403 (AccessDenied): Access Denied

# ubuntu @ ip-172-31-33-149 in ~ [14:59:06] C:77
$ s3cmd ls s3n://big-data-benchmark/pavlo/text-deflate/tiny/

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
    An unexpected error has occurred.
  Please try reproducing the error using
  the latest s3cmd code from the git master
  branch found at:
    https://github.com/s3tools/s3cmd
  and have a look at the known issues list:
    https://github.com/s3tools/s3cmd/wiki/Common-known-issues-and-their-solutions-(FAQ)
  If the error persists, please report the
  following lines (removing any private
  info as necessary) to:
   s3tools-bugs@lists.sourceforge.net

!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

Invoked as: /usr/bin/s3cmd ls s3n://big-data-benchmark/pavlo/text-deflate/tiny/
Problem: <class 'ValueError: s3n://big-data-benchmark/pavlo/text-deflate/tiny/: not a recognized URI

I wander if the dataset is still available on S3 now or is there something wrong with my operation?

peter279k commented 6 months ago

I also have the same problem and it seems that these datasets are not available anymore.

peter279k commented 6 months ago

Maybe @pwendell can help us to check this issue and ensue these datasets can be available again...