Closed RobertLiJN closed 1 year ago
Just a quick note that I don't think the perms on gs://mlperf-llm-public2
are configured properly for public access — I can access buckets like gs://t5-data/vocabs/
no problem, but not this one. I get a similar error as above when trying to grab the spm file per the README (gs://mlperf-llm-public2/vocab/c4_en_301_5Mexp2_spm.model
).
@mathemakitten sorry for the late reply, the mlperf-llm-public2 isn't supposed to be public yet
Sorry to interrupt! When running
in the examples, I encountered the following error seeming to suggest I cannot load from the bucket provided in c4.py
I wonder if this is because I haven't configured something correctly, because the bucket seems like a public one.
I tried using the TFDS default bucket (
gs://tfds-data/datasets
) instead ofgs://mlperf-llm-public2
and this problem doesn't arise, but it requires me to choose among available versions of c4 (not 3.0.4). Even then, I cannot proceed because it gives me some other error.Thanks in advance for your attention and help!