Unexpected tensor allocation

cornell-zhang / heterocl

HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing

Apache License 2.0

326 stars 92 forks source link

When testing the stencil backend, I found that in the IR generated for the Gaussian benchmark, the output tensor is explicitly allocated. I believe this is incorrect because the interface already generates an implicit tensor allocation by calling tvm_struct_get. The blur benchmark works fine.

This unexpected tensor allocation is breaking the code for SODA code generation. More specifically, it invalidates the VarExpr comparison because the newly generated Variable is used in the IR, which is not linked to the interface. This results in incorrect detection of output or local tensors in SODA. As a workaround, I had to compare by name_hint, but it may not work in other situations, as the name suggests.

The IR is printed in the test_soda.py unit test and can be reproduced by running python -m unittest test_soda in heterocl/heterocl/tests.

cornell-zhang / heterocl

Unexpected tensor allocation #39