Open VincyZhang opened 9 months ago
It seems that the issue is related to the missing API key in the OpenAI client. You need to provide a valid API key to access the OpenAI API. You can obtain an API key from the OpenAI website. Once you have the API key, replace "EMPTY" in the openai.api_key = "EMPTY" line with your actual API key. If you need help, please @NeuralChatBot
The error you are receiving is due to the missing API key in the OpenAI client. You need to provide a valid API key to access the OpenAI API. You can obtain an API key from the OpenAI website. Once you have the API key, replace "EMPTY" in the openai.api_key = "EMPTY" line with your actual API key.
The issue you are facing is related to the missing API key in the OpenAI client. You need to provide a valid API key to access the OpenAI API. You can obtain an API key from the OpenAI website. Once you have the API key, replace "EMPTY" in the openai.api_key = "EMPTY" line with your actual API key.
@NeuralChatBot
@NeuralChatBot
@NeuralChatBot
Is there a specific version of openai that is aligned with the OpenAI interfaces offered by neuralchat? I am currently testing using the current 1.12.0 but encountering a 422 Unprocessable Entity error.
I saw that meta-llama/Llama-2-7b-chat-hf is a supported model and appears to be small enough to fit into my Intel Data Center Flex 170 XPU.
I can successfully run this model locally with the code outlined in deploy_chatbot_on_xpu.
However, when I attempt to use the OpenAI interface per the instructions at https://github.com/intel/intel-extension-for-transformers/tree/main/intel_extension_for_transformers/neural_chat, the server shows 422 Unprocessable Entity and the client gets an error about a missing value. I am assuming this relates to a mismatch between the OpenAI client and the neural_chat server in terms of the required fields. I have also included the text extracted from the tcpdump below.
Following along from the notebook examples, I have prepared textbot.yaml and server.py as below.
Starting the server
Additional logs after starting the TextChatClientExecutor client - successful inference
Additional logs after connecting via OpenAI - failing access
Open AI Client contents
Aside from the shebang and the modified model string, this should be identical to the content on the webpage.
Text from packet capture of exchange
Thank you!