Add initial support for Zephyr 7b Beta

brainlid / langchain

Elixir implementation of a LangChain style framework.

https://hexdocs.pm/langchain/

Other

505 stars 58 forks source link

Add initial support for Zephyr 7b Beta #41

Closed brainlid closed 4 months ago

brainlid commented 7 months ago

This is for running the model directly on hardware using Nx and Bumblebee.

The Zephyr 7B beta LLM doesn't have all the capabilities of ChatGPT, nor the safeguards.

What works:

running a streaming, chat-based interaction
non-streaming call support
- https://github.com/elixir-nx/bumblebee/issues/295
- https://github.com/elixir-nx/bumblebee/issues/247
varied output - https://github.com/elixir-nx/bumblebee/issues/284

What doesn't work:

cancelling - we can kill the process handling the stream, but the GPU could keep going until finished or it reaches the token limit
no function support

Closes #26

brainlid commented 7 months ago

Zepher-7b Beta does NOT support function calling. It doesn't understand how to do it and has not been trained for it.

There are alternate models that have fine-tuned Zephyr for function calling, but those have licensing problems. They trained the model using OpenAI, which is a violation of the terms of use.

acalejos commented 5 months ago

Zepher-7b Beta does NOT support function calling. It doesn't understand how to do it and has not been trained for it.

There are alternate models that have fine-tuned Zephyr for function calling, but those have licensing problems. They trained the model using OpenAI, which is a violation of the terms of use.

@brainlid

Have you thought about any ways to support Functions through rolling a custom dispatching? My thought is using something like Instructor to coerce the LLM into categorizing the task thats being asked into a set of function. You add the task description as one of the possible outputs. Then you map all of the tasks to their respective functions. Huggingface has a diagram that sort of shows what I'm referring to here. You could also coerce the parameters using instructor as well.

I haven't tried this yet, but just wanted to throw the idea out there.

brainlid commented 5 months ago

@acalejos Yes! As you probably know by now, I interviewed Thomas Millar about InstructorEx in the episode that came out today.

The challenge is that Instructor doesn't work with Bumblebee yet, and relies on a llamacpp ability to restrict the output grammar, forcing it into a compliant JSON structure.

I'm very interested in the work going on there and this direction. It's very cool.