onecx-apps / onecx-chat

OneCx chat Management
Apache License 2.0
0 stars 0 forks source link

Create LLama 2 langchain integration #7

Closed michaelgloeckner closed 11 months ago

michaelgloeckner commented 1 year ago

Create Llama2Service which uses llama2 7b endpoint of kubernetes or sagemaker.

Kubernetes example call: curl -X POST -H 'Content-Type: application/json' -d '{ "messages": [{"role": "user", "content": "How are you?"}], "model": "llama-2-7b-chat.ggmlv3.q4_0.bin", "stream": false}' http://genai.one-cx.org:8000/v1/chat/completions

Sagemaker