inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/tensorrtllm_backend #421

How is GptManager used in Triton backend?

I see that Triton backend creates an [object of GptManager](https://github.com/triton-inference-server/tensorrtllm_backend/blob/bf5e9007a3f16c7fc76cb156a3362d1caae198dd/inflight_batcher_llm/src/model_…

ekagra-ranjan updated 5 months ago
1
triton-inference-server/server #6997

Error generating stream: TextEncodeInput must be Union[TextI…

hi everyone i runing tritonserver vllm and i want runing with dynamic batching, but i encountered an error. It seems like it has something to do with my input Inference with curl: curl -X POST loca…

thanhtung901 updated 6 months ago
3
kubeflow/pipelines #10815

The examples for nvidia-resnet cannot be built using existin…

### **Feature Area** /area backend /area sdk The examples for nvidia-resnet cannot be built using existing scripts. ### **What feature would you like to see?** Update existing nvidia-resnet o…

a1856315445 updated 6 days ago
4
cjpais/whisperfile #2

ffmpeg in path but not detected by whisperfile on Windows 10

I cannot start the whisperfile even though ffmpeg is definitely in the path. have tried running cmd as administrator as well as copying the ffmpeg.exe file to the same directory with same results ever…

ethanpil updated 2 days ago
12
TerriaJS/terriajs #1243

Spatial Detailing interface gives too many region options

The spatial detailing code only supports 6 region types, but the user is able to select any of the terriaJS region types. If that region type is not supported, the inference server will throw an error…

lmccalman updated 8 years ago
4
triton-inference-server/server #6486

[Question] can the TensorRT backend support python defined p…

Looking at the release of TensorRT 9.1.0. I am very happy to see the integration of openai-triton with TensorRT plugins. However [one limitation of this integration is that python must be availabl…

MatthieuToulemont updated 11 months ago
1
gofireflyio/aiac #125

Truncated output with local backend?

➜ aiac --version aiac version 5.2.1 We are using local backends provided by huggingface TGI ```yaml [backends.phi3] type = "openai" default_model = "Phi-3" url = "https://phi3.ourcluster/…

remmen-io updated 2 months ago
2
caikit/caikit-nlp #119

Document TGIS build / integration process for example script…

## Description Currently our example evaluation scripts require TGIS docker images to be available locally. This procedure is undocumented a bit undefined unless someone knows how to build TGIS loc…

gkumbhat updated 10 months ago
1
triton-inference-server/server #4095

Is it possible to make gRPC to use a unix socket instead of …

We have a streaming service that uses gRPC with Unix sockets. The gRPC performs way better with Unix socks in comparison with a TCP port. I saw that you can only change the port in the triton server…

PauloFavero updated 5 days ago
7
fauxpilot/fauxpilot #133

How to run multiple models on one machine (w/ one GPU)

I am running inference tasks conveniently with CodeGen models, thanks to the FauxPilot community. Thank you again. Additionally I wonder if it is possible to run multiple models on a single GPU. Bel…

leemgs updated 1 year ago
4

上一页 1...80 81 82 83 84 85 86...100 下一页

1000+ results for inference-server

1000+ results
for inference-server