This introduces the use_current_device function which initializes CUBLAS for the currently active device, thus allowing the use of blas.jl functions on non-default GPU.
In addition, move initialization (and registration of the finalizer) into __init__ so that we can make use of pre-compilation in the future.
This introduces the
use_current_device
function which initializes CUBLAS for the currently active device, thus allowing the use ofblas.jl
functions on non-default GPU.In addition, move initialization (and registration of the finalizer) into
__init__
so that we can make use of pre-compilation in the future.The default behavior should be unchanged by this.