Closed chenjw505 closed 3 years ago
first,thanks for ur code!it‘s very helpful! in your code, when your calculate the mu&sigma use the h from all three layers。
but in the deepar paper calculate the mu&sigma use the h from the last layers 。
which one is better?
I haven't compared, but I guess it should be straightforward to implement DeepAR's original architecture.
first,thanks for ur code!it‘s very helpful! in your code, when your calculate the mu&sigma use the h from all three layers。
but in the deepar paper calculate the mu&sigma use the h from the last layers 。
which one is better?