Open usaxena-asapp opened 2 months ago
Consistency LLM: https://hao-ai-lab.github.io/blogs/cllm/ claims to speed up inference. I wonder what version of this we can support in vllm?
No response
the paper indeed seems interesting ( like a mix of diffusion models + auto regressive models)
🚀 The feature, motivation and pitch
Consistency LLM: https://hao-ai-lab.github.io/blogs/cllm/ claims to speed up inference. I wonder what version of this we can support in vllm?
Alternatives
No response
Additional context
No response