issues
search
siyan-zhao
/
prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"
https://arxiv.org/abs/2404.09529
56
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
AMAZING WORK! 4d mask support.
#1
aldopareja
opened
6 months ago
3