Why use CNN with the output of RNN, instead of original features to extract temporal
information for each variables? Maybe you do some experiments, but forget to mention it in paper?
With different batch, the maximum length of temporal series may be different. How can we deal this issue, if we use static kernel size?
Vice nice project!
But I'm confused with:
Why use CNN with the output of RNN, instead of original features to extract temporal information for each variables? Maybe you do some experiments, but forget to mention it in paper?
With different batch, the maximum length of temporal series may be different. How can we deal this issue, if we use static kernel size?
Cheers!