pytorch / torchtune

A Native-PyTorch Library for LLM Fine-tuning
BSD 3-Clause "New" or "Revised" License
3.53k stars 285 forks source link

[WIP] Sample packing deep dive #1083

Open RdoubleA opened 2 weeks ago

RdoubleA commented 2 weeks ago

Context

Deep dive in live docs on sample packing with in-sample masking and position encoding and how it impacts training throughput and model quality

Test plan

Please make sure to do each of the following if applicable to your PR. (If you're not sure about any one of these just ask and we will happily help.)

pytorch-bot[bot] commented 2 weeks ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1083

Note: Links to docs will display an error until the docs builds have been completed.

:x: 1 New Failure

As of commit 9286a0fc6b34c960f54c37de26989e3eaf7435d1 with merge base 0e0aff0bbfeb3e67175ccdab391d539284831ae2 (image):

NEW FAILURE - The following job has failed:

* [Build Docs / build_docs (3.11)](https://hud.pytorch.org/pr/pytorch/torchtune/1083#26159345117) ([gh](https://github.com/pytorch/torchtune/actions/runs/9492349980/job/26159345117))

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ebsmothers commented 1 week ago

doc build failed