Open xu19971109 opened 1 year ago
N_scale is set the same in train and eval, batch 2 or batch 4 can be used in training, but eval will report out of memory error after batch 2 or batch 4 is run.
N_scale is set the same in train and eval, batch 2 or batch 4 can be used in training, but eval will report out of memory error after batch 2 or batch 4 is run.