Closed xiachong94 closed 8 months ago
from torch_int._CUDA import bmm_s8t_s8n_s8t import torch bmm_s8t_s8n_s8t(torch.randint(-128, 127, (64,4,4), dtype=torch.int8).cuda(), torch.randint(-128, 127, (64, 64, 4), dtype=torch.int8).cuda(), 0.001)
Traceback (most recent call last): File "", line 1, in RuntimeError: cutlass cannot implement
from torch_int._CUDA import bmm_s8t_s8n_s8t import torch bmm_s8t_s8n_s8t(torch.randint(-128, 127, (64,4,4), dtype=torch.int8).cuda(), torch.randint(-128, 127, (64, 64, 4), dtype=torch.int8).cuda(), 0.001)
Traceback (most recent call last): File "", line 1, in
RuntimeError: cutlass cannot implement