Closed gody7334 closed 3 years ago
I train the EncNet using Fastai, It OK witout mix percision training, when I add learn().to_fp16(), it give me this error,
Runtime Error: "ScaledL2_Forward_CUDA" not implemented for 'Half'
I saw pytorch apex lib just add modules into fp32 list, Any suggestion to add Hlaf to customed cuda operation?
Thanks.
You may use this pure python implementation https://github.com/open-mmlab/mmsegmentation/blob/9f071cade8cdc59c13b416c7c9843005410c055c/mmseg/ops/encoding.py#L6
Thanks :+1:
Resolved
I train the EncNet using Fastai, It OK witout mix percision training, when I add learn().to_fp16(), it give me this error,
Runtime Error: "ScaledL2_Forward_CUDA" not implemented for 'Half'
I saw pytorch apex lib just add modules into fp32 list, Any suggestion to add Hlaf to customed cuda operation?
Thanks.