Closed sanandaraj5597 closed 2 months ago
1) Made a fix to remove unwanted MemcpyDtoD kernels when CPU offloading is enabled.
2) When initializing the Parameter, we need to pass on the weight and the weight.requires_grad. So adding that.
3) Added the fixes in point#2 for grouped_linear module.
/te-ci pytorch
1) Made a fix to remove unwanted MemcpyDtoD kernels when CPU offloading is enabled.
2) When initializing the Parameter, we need to pass on the weight and the weight.requires_grad. So adding that.
3) Added the fixes in point#2 for grouped_linear module.