Closed vinayp1995 closed 4 months ago
Groq team is annoying. They had OpenAI API support. But after their beta testing was done, they continue to say they support it, but their url doesn't work anymore. I've reached out to them several times and never get any response.
f32e3ea1b28b17f8e24fb1d0c5cabca23824ba05
Please follow new instructions as in the commit above and install packages in that commit too.
Thanks for the fast turnaround, it works now.
Hi all,
I have been trying to get h2ogpt to work in remote inference server mode, but so far haven't been successful. This is how I invoke the app, as mentioned here: python generate.py --inference_server="vllm:https://api.groq.com/openai:None:/v1:$GROQ_API_KEY" --base_model='mixtral-8x7b-32768' --max_seq_len=31744 --prompt_type='plain'
And this is the output:
My system has openai 1.12.0 python package installed. Has anyone had any success doing this?