Closed daisy-disc closed 4 years ago
I used bs=16 for single time step training (only feature aggregation), and bs=4 for recurrent training (time step = 4).
I used bs=16 for single time step training (only feature aggregation), and bs=4 for recurrent training (time step = 4).