inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e-mission/e-mission-docs #950

:zap: [MLOps] don't load the model before labeling each trip

While looking at the reason why one of the pipelines was stuck in the `LABEL_INFERENCE`, I found that we appear to re-load the full trip model before each trip inference. This is: - unncessary, si…

shankari updated 9 months ago
30
ros-ai/ros2_whisper #12

Add support for continuous listening

Instead of pressing a key, continuously listen till the wake work is announced (i.e., "Hey Ross")

RoboEvangelist updated 3 months ago
1
pytorch/serve #2466

using https causes "Invalid configuration" error at load

### 🐛 Describe the bug This config works: ``` vmargs=-XX:+UseContainerSupport -XX:InitialRAMPercentage=25.0 -XX:MaxRAMPercentage=100.0 -XX:-UseLargePages -XX:+UseG1GC -XX:+ExitOnOutOfMemoryError i…

sreeprasannar updated 2 months ago
3
AI-GrandChallenge/round-1 #102

[track-1] inference 오류

로컬이랑 테스트 submit(nsml submit -t ~~)에서는 오류가 없는데, 실제 제출에서는 inference가 끝난 후에 아래와 같이 오류가 뜹니다. 이유를 알 수 있을까요? infer쪽은 접근이 제한되어 오류 위치나 원인 찾기가 힘드네요. Building docker image. It may take a while .........lo…

grnd-lab updated 4 years ago
2
triton-inference-server/tensorrtllm_backend #367

sreaming mode doesn't work

### System Info V100*2 nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorrt-llm 0.7.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own mo…

dongteng updated 4 months ago
2
hpcaitech/EnergonAI #125

inference of pre-trained model

Hi, I am very interested in the distributed inference of Colossal AI. Since we have pre-trained NLP models from Pytorch or JAX, I wonder if possible or what should be done to use EnergonAI for infere…

Emerald01 updated 2 years ago
1
kleveross/ormb #47

[feature] Provider unified offline batch inference interface

**Is this a BUG REPORT or FEATURE REQUEST?**: > Uncomment only one, leave it on its own line: > > /kind bug > /kind feature **What happened**: Investigate if we can use https://github.…

gaocegege updated 4 years ago
6
huggingface/chat-ui #1476

Update docs to explain how to use `tokenizer` field for chat…

## Bug description In README.md, it's stated that the prompts used in production for HuggingChat can be found in PROMPTS.md. However, PROMPTS.md has not been updated for 7 months and there are s…

horsten updated 1 week ago
2
pytorch/serve #740

Add generic support for different GPU hardware.

## Is your feature request related to a problem? Please describe. Currently, TorchServe's sanity suite, regression suite, and the recent changes related to logging [GPU info in the model description]…

harshbafna updated 3 months ago
8
roboflow/cog-vlm-client #2

message: Internal error

Whenever I try to run `script.py` or follow instructions here: https://blog.roboflow.com/how-to-deploy-cogvlm/ I always get this result: `{'message': 'Internal error.'}` Using Gradio also return…

palebluewanders updated 5 months ago
6

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for inference-server

1000+ results
for inference-server