Open hannes-ucsc opened 8 years ago
I'll get to this tomorrow!
I suggest there be 4 options for URL
S3://BUCKET/
- All keys in the bucketS3://BUCKET/KEY/
- All keys in the bucket in folder KEY (must point to a folder)S3://BUCKET/KEY
- single KEY in the bucket (must point to a file)S3://BUCKET/KEY + --prefix
- All keys in the bucket in with prefix KEYThere is a small chance that the user may have something like
UCSC/
UCSC/file_1
UCSC/file_2
UCSC_manifest
And if we use your suggestion, saying S3://BUCKET/UCSC --prefix
gives you all 4.
Using this suggestion, you can get all files in UCSC with S3://BUCKET/UCSC/
Might be worth considering?
URL should be enforced to be
s3://BUCKET/
ors3://BUCKET/KEY
whereKEY
is to be interpreted as a key prefix if--prefix
is given (to be consistent withs3am upload
). All matching objects on S3 should be listed and the master key should be used to derive a per-file key. The resulting per-file key and the object's URL should be written to the output file, one line per object in the forms3://BUCKET/FULL/KEY\tSSE_KEY_BASE64
whereSSE_KEY_BASE64
is the base64 encoding of the binary SSE key and\t
is the tab character.The master key should be specified using
--sse-key
and friends but those options should be mandatory forget-sse-keys
, despite the oxymoronic nature of that requirement.