python3 main.py --model gpt3_XL_256_Pile --check_dataset
2021-03-23 01:49:48.909831: W tensorflow/stream_executor/platform/default/dso_loader.cc:60] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2021-03-23 01:49:48.909862: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
WARNING:tensorflow:From /opt/conda/envs/gptx/lib/python3.8/site-packages/tensorflow/python/compat/v2_compat.py:96: disable_resource_variables (from tensorflow.python.ops.variable_scope) is deprecated and will be removed in a future version.
Instructions for updating:
non-resource variables are not supported in the long term
Current step 400000
Traceback (most recent call last):
File "main.py", line 257, in <module>
main(args)
File "main.py", line 85, in main
check_dataset(input_fn, params, global_step=current_step)
File "/root/Repo/GPTNeo/utils.py", line 206, in check_dataset
tf.enable_eager_execution()
File "/opt/conda/envs/gptx/lib/python3.8/site-packages/tensorflow/python/framework/ops.py", line 5863, in enable_eager_execution
return enable_eager_execution_internal(
File "/opt/conda/envs/gptx/lib/python3.8/site-packages/tensorflow/python/framework/ops.py", line 5925, in enable_eager_execution_internal
raise ValueError(
ValueError: tf.enable_eager_execution must be called at program startup.
I've got nothing to add to this one now for the same reason I listed at the bottom of issue 158. Feel free to re-open if you think it will help others.
All other details are the same as in my issue here: https://github.com/EleutherAI/gpt-neo/issues/158