yajiemiao / eesen

The official repository of the Eesen project
Apache License 2.0
202 stars 72 forks source link

No gradient clipping in parallel version lstm training? #1

Closed lifelongeek closed 8 years ago

lifelongeek commented 8 years ago

Hi I saw that gradient clipping from non-parallel version of lstm code. (i.e. bilstm-layer.h) But I cannot see corresponding part in parallel version of lstm. (i.e. bilstm-parallel-layer.h) Although training seems fine for almost all cases ( I tried with several different size architecture on swbd), I wonder whether there is some reason you did not include clipping in parallel version lstm.

yajiemiao commented 8 years ago

The class BiLstmParallel inherits BiLstm. They share the same Update() function.

lifelongeek commented 8 years ago

Oh I didn't see that part. I am sorry for my mistake.