Closed HanBing0802 closed 3 years ago
I think you're right that it is a compatibility issue between python and tensorflow versions. I think an earlier version of tensorflow should work fine (maybe try 1.14?). I would expect tensorflow 2 to have significant differences and it wouldn't work without a fair number of changes. Also, if you use a different python version (maybe 3.6, 3.7?) it might have tf 1.15.2? I have used pyenv
successfully in the past to manage various python versions on my system.
I am guessing that error is because deleting the line you mentioned would skip the step where tensorflow is installed. So then when the openai-baselines package is supposed to be installed, it is missing one of its dependencies (tensorflow).
./train.sh TrainPhase1
, the result is as follows. I don't know where is its problem. I hope you can help me, thank you again!
Entered virtualenv.[Server] Making model... [Server] Loading Regression Model then training RL. [NetworkVPCore] Loading checkpoint file: /home/hb/catkin_ws/src/rl_collision_avoidance/ga3c/GA3C/checkpoints/regression/wandb/run-rnn/checkpoints/network_00000000 Traceback (most recent call last): File "/home/hb/.pyenv/versions/3.6.5/lib/python3.6/site-packages/tensorflow_core/python/client/session.py", line 1365, in _do_call return fn(*args) File "/home/hb/.pyenv/versions/3.6.5/lib/python3.6/site-packages/tensorflow_core/python/client/session.py", line 1350, in _run_fn target_list, run_metadata) File "/home/hb/.pyenv/versions/3.6.5/lib/python3.6/site-packages/tensorflow_core/python/client/session.py", line 1443, in _call_tf_sessionrun run_metadata) tensorflow.python.framework.errors_impl.DataLossError: not an sstable (bad magic number) [[{{node save/RestoreV2}}]]
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "Run.py", line 75, in
Original stack trace for 'save/RestoreV2':
File "Run.py", line 75, in
hmm looks like this error has to do with the model file that is being loaded. i am guessing it is trying to load one of the files that was trained via regression, and then trying to begin RL from that starting point? The file it's looking for is at /home/hb/catkin_ws/src/rl_collision_avoidance/ga3c/GA3C/checkpoints/regression/wandb/run-rnn/checkpoints/network_00000000
according to that log -- does that file exist on your machine?
Hello, thanks for sharing your awesome work. I'm a beginner about this field and this is the first time I learn about it. I have been troubled by the above questions for a long time. I hope you can help me, thank you very much!
when I run
./install.sh
, I got some bug as follows:Obtaining file:///home/hb/catkin_ws/src/rl_collision_avoidance/gym-collision-avoidance Collecting tensorflow==1.15.2 (from gym-collision-avoidance===1.0.0) Could not find a version that satisfies the requirement tensorflow==1.15.2 (from gym-collision-avoidance===1.0.0) (from versions: 0.12.0rc0, 0.12.0rc1, 0.12.0, 0.12.1, 1.0.0, 1.0.1, 1.1.0rc0, 1.1.0rc1, 1.1.0rc2, 1.1.0, 1.2.0rc0, 1.2.0rc1, 1.2.0rc2, 1.2.0, 1.2.1, 1.3.0rc0, 1.3.0rc1, 1.3.0rc2, 1.3.0, 1.4.0rc0, 1.4.0rc1, 1.4.0, 1.4.1, 1.5.0rc0, 1.5.0rc1, 1.5.0, 1.5.1, 1.6.0rc0, 1.6.0rc1, 1.6.0, 1.7.0rc0, 1.7.0rc1, 1.7.0, 1.7.1, 1.8.0rc0, 1.8.0rc1, 1.8.0, 1.9.0rc0, 1.9.0rc1, 1.9.0rc2, 1.9.0, 1.10.0rc0, 1.10.0rc1, 1.10.0, 1.10.1, 1.11.0rc0, 1.11.0rc1, 1.11.0rc2, 1.11.0, 1.12.0rc0, 1.12.0rc1, 1.12.0rc2, 1.12.0, 1.12.2, 1.12.3, 1.13.0rc0, 1.13.0rc1, 1.13.0rc2, 1.13.1, 1.13.2, 1.14.0rc0, 1.14.0rc1, 1.14.0, 2.0.0a0, 2.0.0b0, 2.0.0b1) No matching distribution found for tensorflow==1.15.2 (from gym-collision-avoidance===1.0.0)
I find it is due topython3 -m pip install -e $DIR
in /rl_collision_avoidance/gym-collision-avoidance/install.sh. Now I use python3.5, tehsorflow1.15.2. And I guess this problem is due to inconsistent version? Is that ture?When I delete this code", line 1, in
File "/tmp/pip-0g8mhuix-build/setup.py", line 58, in
assert tf_pkg is not None, 'TensorFlow needed, of version above 1.4'
AssertionError: TensorFlow needed, of version above 1.4
python3 -m pip install -e $DIR
and run again, there is another bug as follows: `Collecting git+https://github.com/openai/baselines.git Cloning https://github.com/openai/baselines.git to /tmp/pip-0g8mhuix-build Complete output from command python setup.py egg_info: running egg_info creating pip-egg-info/baselines.egg-info writing dependency_links to pip-egg-info/baselines.egg-info/dependency_links.txt writing requirements to pip-egg-info/baselines.egg-info/requires.txt writing pip-egg-info/baselines.egg-info/PKG-INFO writing top-level names to pip-egg-info/baselines.egg-info/top_level.txt writing manifest file 'pip-egg-info/baselines.egg-info/SOURCES.txt' warning: manifest_maker: standard file '-c' not found reading manifest file 'pip-egg-info/baselines.egg-info/SOURCES.txt' writing manifest file 'pip-egg-info/baselines.egg-info/SOURCES.txt' Traceback (most recent call last): File "Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-0g8mhuix-build/`