apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators
https://tvm.apache.org/
Apache License 2.0
11.46k stars 3.41k forks source link

[TOPI][Testing] Enable conv2d NHWC fp16 topi testing for `arm_cpu` #17007

Closed Anndrey24 closed 2 months ago

Anndrey24 commented 2 months ago

This commit adds fp16 test cases to the conv2d NHWC TOPI schedules for arm_cpu. Following the example of #8529, the numpy reference conv2d output is computed in fp32 instead of fp16, while the absolute tolerance varies for each test case according to the size of the summed axis and the output's largest element.

cc @ekalda @lhutton1

ekalda commented 2 months ago

Thanks @Anndrey24 and @lhutton1!