Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Apache License 2.0
1.2k
stars
80
forks
source link
`thunder.jit`ted `Tensor.masked_fill` and `Tensor.masked_fill_` return `torcdh.float32` tensor even when input is `torch.int64` #1083
Note: If you have a model or program that is not supported yet but should be, please use the program coverage template.
🐛 Bug
As per title.
To Reproduce
Expected behavior
Returns the same dtype as the pytorch result.
Environment
conda
,pip
, source):Additional context