-
gtsam_points/cuda/kernels/vgicp_derivatives.cuh(49): error: calling a __host__ function("Eigen::MatrixBase< ::Eigen::CwiseBinaryOp< ::Eigen::internal::scalar_sum_op , const ::Eigen::Matrix , const :…
-
The following are completed in #3150 for `n = 2`:
- [x] awkward_ListArray_combinations
- [x] awkward_RegularArray_combinations_64
The following are in completed in #3149:
- [x] awkward_reduc…
-
### System Info
After training `Zyphra/Zamba2-1.2B` trying to run inference on CPU but got an error:
```
File "virtual_envs/neural_asr_training/lib/python3.10/site-packages/causal_conv1d/causal…
-
## Description
We support a limited subset of kernels with HIP devices compared to CUDA.
### Describe the solution you'd like
Every algorithm that is CUDA accelerated should also be HIP accel…
-
I was able to create a plugin and this repo really helped with the boiler plate code.
One addition that I want to make in my `enqueue` function is that I want to call my cuda binding.
How can I make…
-
```
(TinyChatEngine) zhef@zhef:~/TinyChatEngine/llm$ make chat -j
CUDA is available!
src/Generate.cc src/LLaMATokenizer.cc src/OPTGenerate.cc src/OPTTokenizer.cc src/utils.cc src/nn_modules/Fp32OPT…
-
Hey,
I am working deeply on your code.
I would like to ask you a favor, and if you could please help me to understand the cuda kernels.
My email adress is thomasc@helix.re
I have benchmark you…
-
Hi, guys,
I compiled the example code for cusparseLt here: [https://github.com/NVIDIA/CUDALibrarySamples/tree/master/cuSPARSELt/matmul](url), which I used the default problem size, and used Nsight sy…
-
Hi! Thank you for your amazing work!
I'm having some trouble on comparing the fused swiglu kernel with its plain pytorch version.
I checked the wall clock time with code below, and it gives me l…
-
I‘m trying to follow [this](https://github.com/mit-han-lab/llm-awq#install) to install awq.
But failed at step 3.
## My Env
```
OS: Windows 11
GPU: NVIDIA GeForce RTX4060
Driver Version: 536.4…