issues
search
triton-inference-server
/
fastertransformer_backend
BSD 3-Clause "New" or "Revised" License
411
stars
134
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
use nvidia-smi to track mem usage
#12
yuanzhedong
closed
3 years ago
0
Refine benchmark script with mem usage
#11
yuanzhedong
closed
3 years ago
0
refine benchmark script
#10
yuanzhedong
closed
3 years ago
0
add script to benchmark latency on single node
#9
yuanzhedong
closed
3 years ago
0
add more params to identity_test.py
#8
yuanzhedong
closed
3 years ago
0
feat: Support multi-node serving
#7
byshiue
closed
3 years ago
0
V1.1 dev - Add Multi-Node Support
#6
PerkzZheng
closed
3 years ago
0
Fix backend naming to use root 'fastertransformer' instead of 'transformer'
#5
deadeyegoodwin
closed
3 years ago
0
Triton backend API version issue
#4
GwangsooHong
closed
3 years ago
2
Triton backend API version issue
#3
GwangsooHong
closed
3 years ago
0
V1.0 dev
#2
byshiue
closed
3 years ago
0
feat: Add v1.0 codes
#1
byshiue
closed
3 years ago
0
Previous