LostRuins / koboldcpp

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
https://github.com/lostruins/koboldcpp
GNU Affero General Public License v3.0
4.36k stars 312 forks source link

CUDA 12 CI #815

Closed henk717 closed 2 months ago

henk717 commented 2 months ago

This PR does a couple of changes, including one minor breaking change.

Keep the following in mind:

LostRuins commented 2 months ago

I'm thinking for the binary filename we can leave the old name to be the same as it was for CUDA11, to avoid breaking any existing links or workflows.

The CUDA 12 version can be created with the new suffix instead.

So the "default" normal build is the same as before (CUDA11) and then people who need cuda12 and don't mind massive files can get the oversized version.

LostRuins commented 2 months ago

Let me get 1.64 release out first.

henk717 commented 2 months ago

This would enable a 1.64 cuda 12 build for Linux. Old binary name is a bit trickier in the way I built it but it could be done with additional if statements.

LostRuins commented 2 months ago

What do you think of the naming?

LostRuins commented 2 months ago

we can just add an extra line to rename the file

LostRuins commented 2 months ago

Is it ready to merge?

henk717 commented 2 months ago

Yes