pytorch / xla

Enabling PyTorch on XLA Devices (e.g. Google TPU)
https://pytorch.org/xla
Other
2.49k stars 483 forks source link

Add example for training small LLM #7189

Open duncantech opened 5 months ago

duncantech commented 5 months ago

📚 Documentation

Create an example on how to train a small LLM.

Add it to the examples directory here: https://github.com/pytorch/xla/tree/master/examples

bhavyashahh commented 5 months ago

/assigntome

duncantech commented 5 months ago

@bhavyashahh We haven't heard an update on this in a bit and just wondering if you're still working on it?

bhavyashahh commented 5 months ago

@duncantech I want to ask about the expectation of the llm training example, does train_decoder_only_base.py not serve the purpose of the training script? Is there a requirement of using any open source llm, prepare data, train a few steps and predict ?

duncantech commented 5 months ago

You can use any open source LLM but we wanted to have more than one sample