axolotl-ai-cloud / axolotl

Go ahead and axolotl questions
https://axolotl-ai-cloud.github.io/axolotl/
Apache License 2.0
7.87k stars 866 forks source link

RL training smoke tests #1072

Open winglian opened 10 months ago

winglian commented 10 months ago

⚠️ Please check that this feature request hasn't been suggested before.

🔖 Feature description

We need to add smoke tests and integration tests for axolotl w. trl to ensure we have compatibility with them as both projects iterate. We can start with the supported methods such as DPO, IPO, KTO-pairs. Let's just use some tiny models like tiny-llama or sheared llama.

✔️ Solution

add tests under e2e for dpo, kto_pair, and iso

❓ Alternatives

No response

📝 Additional Context

No response

Acknowledgements

JohanWork commented 9 months ago

Was looking into to trying to solve this issue, but believe it already is solved here https://github.com/OpenAccess-AI-Collective/axolotl/blob/main/tests/e2e/test_dpo.py or @winglian