issues
search
triton-inference-server
/
client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
BSD 3-Clause "New" or "Revised" License
521
stars
225
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Move GenAI-Perf profiling to its own subcommand
#745
dyastremsky
opened
1 hour ago
0
Update the name of Hugging Face TEI
#744
dyastremsky
closed
4 hours ago
0
Update GAP tutorial of vllm backend
#743
AndyDai-nv
opened
8 hours ago
0
feat: Client input byte size checks
#742
yinggeh
opened
1 day ago
0
Update version genai-perf to 0.0.4
#741
fpetrini15
closed
1 day ago
0
Revert "Update version 0.0.4 (#739)"
#740
fpetrini15
closed
1 day ago
0
Update genai-perf to version 0.0.4
#739
fpetrini15
closed
1 day ago
0
Decreased Accuracy in Text Detection and Recognition Models after Upgrading to tritonclient 23.04-py3
#738
ashlinghosh
opened
1 day ago
0
Update default behavior for max threads
#737
lkomali
opened
4 days ago
0
Benchmarking VQA Model with Large Base64-Encoded Input Using perf_analyzer
#736
pigeonsoup
opened
4 days ago
0
Enable client-side batching for OpenAI
#735
dyastremsky
closed
6 days ago
1
Remove unnecessary OpenAI rankings parsing branch
#734
dyastremsky
closed
6 days ago
0
Delayed aio infer during burst requests
#733
evanarlian
opened
6 days ago
0
Guard GenAI-Perf plot generation
#732
dyastremsky
closed
1 week ago
1
Memory leak from grpcio
#731
AlexanderKomarov
opened
1 week ago
0
Can not import GRPC Tritonclient in Seldon MLServer
#730
haiminh2001
closed
1 week ago
1
Fix filepath
#729
debermudez
closed
1 week ago
0
Add support for Hugging Face Text Embeddings Interface's re-ranker API
#728
dyastremsky
closed
6 days ago
0
Add testing for DataLoader
#727
AndyDai-nv
opened
1 week ago
2
Bug in grpc client
#726
gerasim13
opened
1 week ago
0
Update tutorial to be two commands each
#725
tgerdesnv
opened
1 week ago
0
Add rankings support
#724
debermudez
closed
1 week ago
1
Support ranking API profile data parsing and metrics calculation
#723
nv-hwoo
closed
1 week ago
1
Minimum Impact PA Migration Changes
#722
fpetrini15
opened
1 week ago
0
Update GenAI-Perf metric unit assignment to avoid overwrites
#721
dyastremsky
closed
1 week ago
0
[WIP] LLaVA support
#720
mwawrzos
opened
1 week ago
0
Support embedding metrics and output export
#719
nv-hwoo
closed
1 week ago
1
Update GenAI-Perf development version
#718
mc-nv
closed
1 week ago
0
Document how to profile embeddings models
#717
dyastremsky
closed
1 week ago
0
Migrate PA repo
#716
fpetrini15
opened
1 week ago
0
Add embedding and ranking support
#715
dyastremsky
closed
1 week ago
0
How to write cmakelist to use grpcclient?
#714
lyp741
opened
1 week ago
0
Add embedding support
#713
dyastremsky
closed
1 week ago
0
Reorganize metrics and data parser
#712
nv-hwoo
closed
2 weeks ago
2
[QUESTION] - Tensorflow python infer to triton client
#711
eyalhir74
opened
2 weeks ago
0
tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] Unable to open shared memory region: '/output_simple'
#710
tangxueduo
opened
2 weeks ago
0
Generate LLM inputs for embeddings endpoint
#709
dyastremsky
closed
2 weeks ago
1
Support embeddings and rankings in GenAI-Perf
#708
dyastremsky
closed
3 weeks ago
1
Piotrm request client
#707
piotrm-nvidia
opened
3 weeks ago
0
Remove ITL from console when non-streaming
#706
nv-hwoo
closed
3 weeks ago
1
Update input data docs to include allowing outputs in provided directory
#705
matthewkotila
closed
2 weeks ago
0
Fix tutorials
#704
tgerdesnv
closed
3 weeks ago
0
genai-perf KeyError: 'service_kind'
#703
highheart
closed
1 week ago
6
Allow multiple prompts to be supplied via --input-file to GenAI-Perf
#702
dyastremsky
closed
3 weeks ago
3
Update metric name from input/output tokens to input/output sequence lengths
#701
nv-hwoo
closed
3 weeks ago
4
Add tensorrtllm_engine option to service-kind and update testing
#700
debermudez
closed
3 weeks ago
0
How do I get genai-perf to analyze my defined data set
#699
highheart
closed
1 week ago
2
Calculate total output token from full text
#698
IzzyPutterman
closed
3 weeks ago
4
fix: Get validation outputs by name rather than index
#697
krishung5
closed
4 weeks ago
2
Add documentation on installing PA dependencies on Ubuntu
#696
matthewkotila
closed
1 month ago
0
Next