Consider supporting groq

Gene-Weaver / VoucherVision

Initiated by the University of Michigan Herbarium, VoucherVision harnesses the power of large language models (LLMs) to transform the transcription process of natural history specimen labels.

https://huggingface.co/spaces/phyloforfun/VoucherVision

GNU General Public License v3.0

18 stars 4 forks source link

Consider supporting groq #6

Open nickynicolson opened 5 months ago

nickynicolson commented 5 months ago

Groq provides an OpenAI compatible API to several LLMs e.g. LLaMA3 8b, LLaMA3 70b, Mixtral 8x7b, Gemma 7b (documented on the models page) and free access - as of 2024-06-24 usage limits are:	ID	Request per minute	Requests per day
llama3-8b-8192	30	14,400	30,000
gemma-7b-it	30	14,400	15,000
mixtral-8x7b-32768	30	14,400	5,000
llama3-70b-8192	30	14,400	6,000

Is it worth considering supporting this as an LLM API provider?

Gene-Weaver commented 5 months ago

Absolutely! This is certainly on the todo list. Mixtral-8x7b and llama3-70b are both good candidates. This might also help you scale up significantly. I will make sure it happens soon. Groq would also be good for the agents workflow given the huge speed advantage.

I also have a sandbox script that calls 256 OpenAI API calls simultaneously, so that can achieve more or less 1,000 transcriptions per minute too (assuming we want to pay for it!).