*bug* batch normalization not appropriately applied

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Apache License 2.0

15.5k stars 3.49k forks source link

Open cbockman opened 6 years ago

cbockman commented 6 years ago

Batch normalization gets applied here:

Per https://www.tensorflow.org/versions/master/api_docs/python/tf/layers/batch_normalization, training=True needs to be set while training (unless something really funky is happening upstream of apply_norm?)

Is a relatively easy fix; happy to issue PR, assuming I am not misunderstanding.

t2t master

rsepassi commented 6 years ago

Would welcome a PR adding a is_training kwarg to apply_norm and updates around the codebase.