triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

k2-fsa/sherpa #248

The client terminal cannot use the multi-process program to …

@csukuangfj The code for this client is in https://github.com/k2-fsa/sherpa/tree/master/triton/client/client.py After my test, if you do not use multi-process mode, that is, send data to the …

arbs-gpu updated 1 year ago
2
comfyanonymous/ComfyUI #2458

Segmentation fault (core dumped)

This is not identical to the other post with the same title. I get this error when running any stable diffusion XL model, and a similar (but shorter) error when I run any 1.0 model. It throws the e…

TheManOfficial updated 2 days ago
4
sungsoo/sungsoo.github.io #22

analysis: Introduction to KServe

# KServe: A Robust and Extensible Cloud Native Model Server ## Related Issues * #21 ## Article Source * [KServe: A Robust and Extensible Cloud Native Model Server](https://thenewstack.io/kser…

sungsoo updated 2 years ago
2
triton-inference-server/server #5841

GPU memory leak when loading/unloading models

**Description** When cycling through the `load model` -> `infer` -> `unload model` scenario we observe a GPU memory leak. This only happens when models are in Torchscript format. There is no leak…

igrinis updated 2 months ago
11
lamikr/rocm_sdk_builder #96

integration of stable diffusion and some other interesting m…

At the moment the rocm sdk builder stack is providing the base for integrating and using the many nice ML projects but does not in itself include them. Some of the projects like openai whisper are…

lamikr updated 1 month ago
8
microsoft/onnxruntime #17451

C++ API, Memory Leak instantiating Ort::Sessions

``` #include int main() { for (size_t i = 0; i < 10; i++) { std::string modelPath = std::string("./model/model.onnx"); Ort::Env env; Ort::Session session = O…

massimiliano96 updated 3 months ago
13
intel/mlir-extensions #659

[Triton] Triton generated kernel cannot be load correctly th…

One very large Triton kernel cannot be load correctly thru the L0 API. Got the error code `0x78000011` from L0 API `zeKernelCreate`. ``` ZE_RESULT_ERROR_INVALID_KERNEL_NAME = 0x78000011, ///< [Va…

chengjunlu updated 1 year ago
3
intel/intel-xpu-backend-for-triton #2095

[2.5] Triton Wheel build failed due to LLVM pin commit does …

Seems that OpenAI server does not have the former pre-built llvm package on centOS. Thus current CI would complain the following: ``` # Build Triton Wheel downloading and extracting https://githu…

Stonepia updated 2 weeks ago
2
triton-inference-server/tensorrtllm_backend #468

`random_seed` seems to be ignored (or at least inconsistent)…

### System Info I've converted Llama 3 using TensorRT-LLM's convert_checkpoint script, and am serving it with the inflight_batcher_llm template. I'm trying to get diverse samples for a fixed input,…

dyoshida-continua updated 3 months ago
4
tensorflow/tensorflow #73922

tritonserver preload trt plugin got warning message and many…

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source binary ### TensorFlow version tf 2.16.2 ### Custom code No ### OS platform and …

LinGeLin updated 1 month ago
10

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for triton-server

1000+ results
for triton-server