Closed lchu-ibm closed 2 months ago
Add llama3 8b config.
we also expose vocab_size to make our dummy dataloader configurable with llama3 so we can overwrite default 32k to 128k.
Add llama3 8b config.
we also expose vocab_size to make our dummy dataloader configurable with llama3 so we can overwrite default 32k to 128k.