but when I did I got a runtime error stating the output should be a cuda tensor.
I am not sure if this error is from my side or if the code is from the code. This is the error I am given
The cuda version I use is 12.4, python version 3.10.12, ninja version 1.11.1.git.kitware.jobserver-1, torch version 2.2.2.
Hi!
I tried using the benchmark text generation
python -m benchmarks.bench_textgen_lora --system punica --batch-size 32
but when I did I got a runtime error stating the output should be a cuda tensor. I am not sure if this error is from my side or if the code is from the code. This is the error I am given
The cuda version I use is 12.4, python version 3.10.12, ninja version 1.11.1.git.kitware.jobserver-1, torch version 2.2.2.