example on finetuning Llama2-70b model on multiple nodes

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

https://skypilot.readthedocs.io

Apache License 2.0

6.8k stars 512 forks source link

example on finetuning Llama2-70b model on multiple nodes #2815

Open giaosudau opened 12 months ago

giaosudau commented 12 months ago

I am finding an example on how to finetuning Llama2-70b model on multiple nodes

concretevitamin commented 12 months ago

This example on finetuning Llama on 1 node is a probably good starting point:

https://github.com/skypilot-org/skypilot/tree/master/llm/vicuna-llama-2
https://github.com/skypilot-org/skypilot/blob/master/llm/vicuna-llama-2/train.yaml

github-actions[bot] commented 1 month ago

This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.

romilbhardwaj commented 1 month ago

Bumping this - we should add multi-node fine-tuning examples for the llama model family.