jrudolph / llama2.scala

Inference Llama 2 in Scala with AVX2 kernels in C (A port of llama2.c from Andrej Karpathy)
Other
67 stars 3 forks source link

Try GPU-accelleration with TornadoVM #5

Open jrudolph opened 1 year ago

jrudolph commented 1 year ago

As suggested by @plokhotnyuk in https://twitter.com/aplokhotnyuk/status/1686077196589305856