Update Qwen2.5 configs - Githubissues

joecummings commented 1 week ago

I turned activation checkpointing off for all 0.5B models and for 1.5B LoRA models. No point.
I turned on memory logging

Everything else is cosmetic.

pytorch-bot[bot] commented 1 week ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1999

:page_facing_up: Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

:white_check_mark: No Failures

As of commit 8596e5acce71642a37ef81eab34216a37fc590ed with merge base 18d97f0eb15f50b0d2e569c1ba136de8907edabb (): :green_heart: Looks good so far! There are no failures yet. :green_heart:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

joecummings commented 1 week ago

I thought we use underscores instead of periods?: #1863 (comment)

Yeah this is a misleading comment. We do use underscores for model builders, but the model should just get downloaded to a directory with the exact same name as the model on the Hub.

pytorch / torchtune

Update Qwen2.5 configs #1999

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1999

:white_check_mark: No Failures