Drop `use_cache=False if training_args.gradient_checkpointing`

huggingface / trl

Train transformer language models with reinforcement learning.

http://hf.co/docs/trl

Apache License 2.0

9.59k stars 1.2k forks source link

Closed qgallouedec closed 2 months ago

qgallouedec commented 3 months ago

If I understand properly, these lines:

are because, previously, using cache with gradient checkpointing was broken, see https://github.com/huggingface/trl/issues/145#issuecomment-1459735966

Since https://github.com/huggingface/transformers/issues/21737 has been resolved, I think we can replace these lines by

    use_cache=model_args.use_cache,

wdyt?

vwxyzjn commented 3 months ago

Oh that will be really great! Could you test it out with some larger models to see if it indeed works? E.g., mistral 7B or gemma 2 24B.

qgallouedec commented 2 months ago

Actually, there are still these warnings everywhere:

I'll investigate further