allenai / unified-io-2

Apache License 2.0
572 stars 27 forks source link

All attempts to get a Google authentication bearer token failed, returning an empty token. Retrieving token from files failed with "NOT_FOUND: Could not locate the credentials file.". #16

Open Zh0uzh opened 7 months ago

Zh0uzh commented 7 months ago

The error message is as follows

2024-04-09 17:16:37.556487: W tensorflow/core/platform/cloud/google_auth_provider.cc:184] All attempts to get a Google authentication bearer token failed, returning an empty token. Retrieving token from files failed with "NOT_FOUND: Could not locate the credentials file.". Retrieving token from GCE failed with "ABORTED: All 10 retry attempts failed. The last failure: Error executing an HTTP request: HTTP response code 502 with body '<html>
<head><title>502 Bad Gateway</title></head>
<body>
<center><h1>502 Bad Gateway</h1></center>
<hr><center>nginx/1.16.0</center>
</body>
</html>
'".

2024-04-09 17:18:40.919423: E tensorflow/core/platform/cloud/curl_http_request.cc:614] The transmission of request 0x2953d440 (URI: https://www.googleapis.com/storage/v1/b/t5-data/o/vocabs%2Fcc_all.32000.100extra%2Fsentencepiece.model?fields=size%2Cgeneration%2Cupdated) has been stuck at 0 of 0 bytes for 61 seconds and will be aborted. CURL timing information: lookup time: 3.4e-05 (No error), connect time: 0.019668 (No error), pre-transfer time: 0 (No error), start-transfer time: 0 (No error)

tensorflow.python.framework.errors_impl.AbortedError: All 10 retry attempts failed. The last failure: Error executing an HTTP request: libcurl code 42 meaning 'Operation was aborted by an application callback', error details: Callback aborted
         when reading metadata of gs://t5-data/vocabs/cc_all.32000.100extra/sentencepiece.model``

It seems like the problem is in the file "unified-io-2/unified-io-2/t5x/examples/unified_io/t5_1_1/base.gin"

# Vocabulary (shared by encoder and decoder)
VOCABULARY = @seqio.SentencePieceVocabulary()
seqio.SentencePieceVocabulary.sentencepiece_model_file = "gs://t5-data/vocabs/cc_all.32000.100extra/sentencepiece.model"

I cannot correctly load sentencepiece.model online from "gs://t5-data/vocabs/cc_all.32000.100extra/sentencepiece.model". Is there any way to get it offline?