Closed RuiFilipeCampos closed 8 months ago
I might need to remove data points to avoid the larger context window
I'm just gonna slice the batches in the celery process
can even exclude the data points that exceed the context window
plenty of time in between slices being trained
currently stuck at batch size of 16
this might not be acceptable due to the large size of this dataset