ROCm / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
https://www.deepspeed.ai/
Apache License 2.0
5 stars 3 forks source link

__half2_raw initiation error workaround for transformer inference on ROCm #60

Closed rraminen closed 1 year ago

rraminen commented 1 year ago

Error:

/opt/conda/lib/python3.8/site-packages/deepspeed/ops/csrc/includes/reduction_utils_hip.h:278:43: error: excess elements in struct initializer constexpr __half2_raw zero = {0x0000, 0x0000}; ^~

Related JIRA for proper fix: https://ontrack-internal.amd.com/browse/SWDEV-395559