issues
search
Infini-AI-Lab
/
TriForce
[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding
https://infini-ai-lab.github.io/TriForce/
230
stars
12
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Adapt to open source inference framework
#11
Siegfried-qgf
opened
2 months ago
1
Attention Scores Matrix Visualization
#10
bulaikexiansheng
opened
2 months ago
1
The progress bar does not reflect for a long time
#9
bulaikexiansheng
opened
3 months ago
6
Question about graph verification
#8
diaoyingyu
closed
4 months ago
2
Out of memory on H800
#7
Lucas-TY
opened
5 months ago
8
how to change y1?
#6
Lucas-TY
opened
5 months ago
1
Questions about end2end time cost of the inference request
#5
littletomatodonkey
closed
6 months ago
2
Does Retrieval w/o Hierarchy test with spec decoding?
#4
bxyb
closed
7 months ago
1
Example code to run batched inference?
#3
learning-chip
opened
7 months ago
2
Update README.md
#2
eltociear
closed
7 months ago
1
Update README.md
#1
AwaitFuture
closed
7 months ago
1