Open dniku opened 5 years ago
I have verified that it works with the current version of the justheuristic/practical_rl
Docker image, which contains Tensorflow 1.13.1.
Confirmed that !pip install tensorflow-gpu==1.13.1
instead of %tensorflow_version 1.x
in the Colab init cell fixes the issue in Colab.
It still doesn't work, although we're now installing an old version in Colab.
https://www.coursera.org/learn/practical-rl/discussions/all/threads/b4Bm1b6OEemlhhJkLrq7mA reports that the honor track assignment hangs on Colab with the current TF version (1.14.0), but works with an old one (1.6.0). I have successfully reproduced the issue.
The culprit is somewhere in
basic_model_tf.py
, which is also present almost unmodified inmaster
(week07_seq2seq
). Most likely, we haven't noticed that because no one has attempted that week with TF instead of PyTorch. In any case, this must be fixed one way (finding the cause of the issue) or another (getting rid of TF inmaster
andcoursera
).It seems that the issue is caused by the invocation of
dynamic_rnn
, which hangs (I've added a couple of debugprint
s and the last one that fired was the one before the call to that function).dynamic_rnn
is deprecated; this SO thread is probably relevant for migration: https://stackoverflow.com/questions/54989442/rnn-in-tensorflow-vs-keras-depreciation-of-tf-nn-dynamic-rnn