How to deal with various max_len for different batch?

Vice nice project!

But I'm confused with:

Why use CNN with the output of RNN, instead of original features to extract temporal information for each variables? Maybe you do some experiments, but forget to mention it in paper?
With different batch, the maximum length of temporal series may be different. How can we deal this issue, if we use static kernel size?

Cheers!

shunyaoshih / TPA-LSTM