yaroslavyaroslav / OpenAI-sublime-text

First class Sublime Text AI assistant with GPT-4o and llama.cpp support!
MIT License
130 stars 11 forks source link

llama-2 models support #21

Closed yaroslavyaroslav closed 4 months ago

yaroslavyaroslav commented 12 months ago

I've played with llama-2 models released by Meta just yet, and they're… pretty usable on my MacBook M1 Pro 2021 16Gb RAM (on behalf llama.cpp of course — Metal support for inference brings stunning boost to its inference performance in contrast to the CPU one).

Even at their minimum scale (e.g. 7B-chat and 13B-chat I've tried) they're performing well enough in both terms: prompting speed and quality, and since there's a chat (e.g. instruction based) behavior models provided next to ordinary one it leads to they now usable the same way that any OpenAI chatGPT model did from the user side.

So I'm looking forward to add their support to that plugin to use them locally. Let me elaborate my vision of that a bit.