Open GuoYi0 opened 8 months ago
does it support batchsize > 1 ?
No, it doesn't. CodeLlama met the same problem. I think it is an open question for the community.
May be it supports batchsize>1:https://github.com/lucidrains/speculative-decoding
does it support batchsize > 1 ?