Open chensimian opened 10 months ago
Hi, Please install apex from https://github.com/NVIDIA/apex, or set enable_fused_normlization to False.
Hi, Please install apex from https://github.com/NVIDIA/apex, or set enable_fused_normlization to False. I have installed it, but it is not working.
Maybe the version of apex is not correct, can you have a try that "from apex.normalization import FusedRMSNorm"
Me too !!
RuntimeError: Failed to replace input_layernorm of type LlamaRMSNorm with FusedRMSNorm with the exception: No module named 'fused_layer_norm_cuda'. Please check your model configuration or sharding policy, you can set up an issue for us to help you as well.
And I saw this prompt in examples/language/llama2/scripts/benchmark_70B/3d.sh
# TODO: fix this
echo "3D parallel for LLaMA-2 is not ready yet"
Does it mean , even if I deployed apex correctly, I won't be able to use hybrid_parallel properly ?
And I saw this prompt in
examples/language/llama2/scripts/benchmark_70B/3d.sh
# TODO: fix this echo "3D parallel for LLaMA-2 is not ready yet"
Does it mean , even if I deployed apex correctly, I won't be able to use hybrid_parallel properly ?
Hybrid parallelism can normally work now, Could you run Python and then execute from apex.normalization import FusedRMSNorm
to see if it runs successfully?
And I saw this prompt in
examples/language/llama2/scripts/benchmark_70B/3d.sh
# TODO: fix this echo "3D parallel for LLaMA-2 is not ready yet"
Does it mean , even if I deployed apex correctly, I won't be able to use hybrid_parallel properly ?
Hybrid parallelism can normally work now, Could you run Python and then execute
from apex.normalization import FusedRMSNorm
to see if it runs successfully?
Yes, python -c "from apex.normalization import FusedRMSNorm"
runs successfully.
And I saw this prompt in
examples/language/llama2/scripts/benchmark_70B/3d.sh
# TODO: fix this echo "3D parallel for LLaMA-2 is not ready yet"
Does it mean , even if I deployed apex correctly, I won't be able to use hybrid_parallel properly ?
Hybrid parallelism can normally work now, Could you run Python and then execute
from apex.normalization import FusedRMSNorm
to see if it runs successfully?Yes,
python -c "from apex.normalization import FusedRMSNorm"
runs successfully.
https://blog.csdn.net/iteapoy/article/details/117389407 , please try this.
Can you share your pip list and your cuda version?
🐛 Describe the bug
raise RuntimeError( RuntimeError: Failed to replace input_layernorm of type LlamaRMSNorm with FusedRMSNorm with the exception: Please install apex from source (https://github.com/NVIDIA/apex) to use the fused RMS normalization kernel. Please check your model configuration or sharding policy, you can set up an issue for us to help you as well.
Environment