tlc-pack / tvm-tensorir

Apache License 2.0
8 stars 0 forks source link

[TIR][Schedule] Software pipelining #533

Closed vinx13 closed 2 years ago

vinx13 commented 2 years ago

This PR contains software pipelining for CUDA (without async memcpy). This is a working version but the code need polishment and more test cases

vinx13 commented 2 years ago

@junrushao1994 @jinhongyii @spectrometerHBH this is ready for review