backend parameter not respected in Genred

getkeops / keops

KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows

MIT License

1.03k stars 65 forks source link

Hi again,

I've encountered another small issue, I'm not sure whether it's worth fixing or not.

While running keops with the cpu backend on a machine where cuda is available, the rsqrt operation fails. Some code to reproduce the error follows:

import torch
import pykeops
from pykeops.torch import Genred

def kernel(v):
    formula = 'Rsqrt(v)'
    fn = Genred(formula, ['v=Vi(2)'], reduction_op='Sum', axis=1)
    res = fn(v, backend='CPU')
    return res

def test():
    assert torch.cuda.is_available
    v = torch.randn(100, 10)
    return kernel(v)

if __name__ == "__main__":
    test()

I think I tracked it down to the codegen not having access to the chosen backend ('cpu', 'gpu_1d', etc), hence generating code for the CPU or GPU based on the global config.use_cuda variable. This is problematic for rsqrt since it uses a function only available in nvcc.

As a workaround I can just modify the global variable as well as setting the backend.

getkeops / keops

backend parameter not respected in Genred #248