Open RainYQ opened 5 months ago
Consider to provide official CodeLlama inference speed up support.
Will release after test.
The example of CodeLlama can be found here.
Consider to provide official CodeLlama inference speed up support.