openai / universe-starter-agent

A starter agent that can solve a number of universe environments.
MIT License
1.1k stars 318 forks source link

Weights from `normalized_columns_initializer` are held constant #71

Closed Joshua-Chin closed 7 years ago

Joshua-Chin commented 7 years ago

normalized_columns_initializer returns a tf.constant instead of a tf.Variable. This means that the weights of the linear layers will be held constant through training. I'm not sure if this was intentional.

tlbtlbtlb commented 7 years ago

The tf.Variable is created in the linear function here: https://github.com/openai/universe-starter-agent/blob/master/model.py#L37. It is modified by the trainer to maximize reward. It is normal for the initializer of a variable to be a constant.