huggingface / dataspeech

MIT License
313 stars 48 forks source link

LLM Swarm #8

Closed sanchit-gandhi closed 7 months ago

sanchit-gandhi commented 7 months ago

To annotate 10k hours of description data with Mistral Instruct 7B v0.2 and an 8x H100 80GB node:

=> a 2x difference, which primarily comes from the continuous batching implemented by TGI.