Closed awf closed 3 years ago
So now we can compare ks and ks_fast. Note that this one calls sufrev on every element, so essentially the "backward" includes the forward time.
vrelu3 PyTorch fast 1.692170e+04 Runs: 1541 | Inference: 1.182 ms | Forward: 1.123 ms | Backward 0.924 ms | torch.Size([65536]) vrelu3 Knossos fast 1.692170e+04 Runs: 1442 | Inference: 0.858 ms | Forward: 0.858 ms | Backward 1.739 ms | torch.Size([65536]) vrelu3 Knossos 1.692170e+04 Runs: 528 | Inference: 2.715 ms | Forward: 2.766 ms | Backward 3.984 ms | torch.Size([65536])
Thanks, this works nicely.
So now we can compare ks and ks_fast. Note that this one calls sufrev on every element, so essentially the "backward" includes the forward time.