triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/redis_cache #12

HMSET is deprecated as of Redis 4.0.0

https://github.com/triton-inference-server/redis_cache/blob/437e7e214ff446e6bc12febb44872547a1988fe1/src/redis_cache.cc#L220 Triton inference server is failing to write to Redis on versions past 4.…

zbloss updated 9 months ago
5
unslothai/unsloth #583

Triton 3.0.0 does not work

Hello, I've been playing around with the collab examples on both AMD and NVDA sides and noticed that when using Triton's latest (main branch) it seems to break down at Unsloth's init step in the __…

jhu960213 updated 1 month ago
16
fegler/triton_server_example #1

How to load/unload model to release GPU memory using you cod…

Hello, this is nice work of Triton Server, I would like to ask how to load/unload model in explicit mode of triton? Do you have any relevant code or ideas? Looking forward to your reply.

HLH13297997663 updated 9 months ago
1
triton-inference-server/server #5391

Pass a python dict to triton server python backend

@Tabrizian In [this](https://github.com/triton-inference-server/client/blob/main/src/python/examples/simple_grpc_infer_client.py) example (line 131) they pass a python dict in the `headers` arg of `t…

ukemamaster updated 1 year ago
1
marcoslucianops/DeepStream-Yolo #288

Triton server fails to infer on yolo 7 through deepstream

WARNING: Num classes mismatch. Configured: 80, detected by network: 0 python3: nvdsparsebbox_Yolo.cpp:137: bool NvDsInferParseCustomYolo(const std::vector&, const NvDsInferNetworkInfo&, const NvDsInf…

madisi98 updated 1 year ago
1
triton-inference-server/tensorrtllm_backend #367

sreaming mode doesn't work

### System Info V100*2 nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorrt-llm 0.7.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own mo…

dongteng updated 3 months ago
2
triton-inference-server/client #777

Failing with Generic Error message: Failed to obtain stable …

I am testing on the basic models. Model take input and return the same output of same datatype. Inference is happening: 2024-08-20 09:35:15,923 - INFO - array_final: array([[103]], dtype=uint8) a…

Kanupriyagoyal updated 2 weeks ago
11
triton-inference-server/server #7308

triton malloc fail

**Description** Triton crashes during runtime。 ``` (gdb) info stack #0 0x00007ffff64e4d8b in _int_malloc (av=av@entry=0x7ffe30000020, bytes=bytes@entry=24) at malloc.c:3608 #1 0x00007ffff64e729…

MouseSun846 updated 3 months ago
9
triton-inference-server/server #5622

triton inference client pinned to geventhttpclient==2.0.2, c…

**Description** PR [185](https://github.com/triton-inference-server/client/pull/185) pinned `geventhttpclient==2.0.2` due to a potential change in ssl_context_factory handling. The geventhttpcli…

brightsparc updated 2 months ago
5
fauxpilot/fauxpilot #162

I get an error at ./launch it starts the triton server and t…

I have nvidia and nvidia-docker installed. I have a 1060 with 6GB vram it has compute 6.0 how can I troubleshoot this? when I run other nvidia containers on my PC I have to use --privileged to get …

billyblackburn updated 1 year ago
2

上一页 1...24 25 26 27 28 29 30...100 下一页

1000+ results for triton-server

1000+ results
for triton-server