Open felipemello1 opened 6 days ago
We can add this to the underlying function for tune validate
and call config validation at the beginning of recipe. In fact we should move all these similar validation checks from the recipe into a single validation function that can be shared across all recipes
When we set dataset.packed = True, we expected that for tokenizer to have mas_seq_len. It not, we raise an error. However, this error is only raised after the model was already loaded in memory. We should error much faster, possibly in the init of the recipe.