Open bitgs opened 4 years ago
This gradient cliping sentence seems wrong. clip_grad_var = [gv if gv[0] is None else [tf.clip_by_norm(gv[0], 100.), gv[1]] for gv in gvs]
when gv[0]==0, gv[0] should be clipped. But here, you will do nothing.
This gradient cliping sentence seems wrong. clip_grad_var = [gv if gv[0] is None else [tf.clip_by_norm(gv[0], 100.), gv[1]] for gv in gvs]
when gv[0]==0, gv[0] should be clipped. But here, you will do nothing.