Open aaab8b opened 1 week ago
The checkpoints in the huggingface of Llama(13b,7b) seems cannot be directly loaded in the model when training MiniLLM since its not considered model parallelism. Is there any way to convert the weight to mp=4?
We provide a script to convert the mp of a checkpoint. You can try it.
The checkpoints in the huggingface of Llama(13b,7b) seems cannot be directly loaded in the model when training MiniLLM since its not considered model parallelism. Is there any way to convert the weight to mp=4?