openai-triton Search Results

1000+ results
for openai-triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

InternLM/lmdeploy #1674

[Feature] model name should be settable or follow original f…

### Motivation Currently if pass model name as pass to lmdeploy: ``` docker run -d --runtime nvidia --gpus '"device=0"' \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HUGGING…

pseudotensor updated 4 months ago
6
xorbitsai/inference #2032

NameError: Field name "schema" shadows a BaseModel attribute…

### System Info / 系統信息 Package Version --------------------------------- -------------- absl-py 2.1.0 accelerate 0.33.0 …

magicyuan876 updated 1 month ago
12
THUDM/GLM-4 #136

VLLM 部署 glm-4-9b-chat-1m模型，推理服务崩溃

请问下部署这个模型需要用到哪些版本呢？我部署成功了，但是一调接口就直接500了使用的版本如下： sh-4.2$ pip list | grep -P "vllm|torch|cuda" nvidia-cuda-cupti-cu12 12.1.105 nvidia-cuda-nvrtc-cu12 12.1.105 nvidia-cuda-r…

cheng92hao updated 4 months ago
1
sgl-project/sglang #1301

[Bug] A100 PCIE torch compile error

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

zhyncs updated 1 month ago
3
instill-ai/instill-core #666

[INS-341] decouple the Triton serving platform from model-ba…

Make Triton a model-serving component that is *optional* in VDP, users can enable it depending on whether they want to self-host their models on VDP via Triton. For each model that is deployed via T…

xiaofei-du updated 5 months ago
1
sgl-project/sglang #1251

[Bug] Error in loading Qwen2-57B-A14B-Instruct

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

LucienShui updated 1 month ago
2
intel/intel-xpu-backend-for-triton #307

[FE] Upstream the changes to public Triton to support dispat…

The latest public Triton basically supports to dispatching the complication flow to the 3p plug-in now. But there is still some miscellaneous changes required to make XPU backend work. 1. Add XPU …

chengjunlu updated 4 months ago
2
intel/intel-xpu-backend-for-triton #455

Remove `TritonGPUToLLVMBase.h` to be synchronize with upstre…

OpenAI upstream commit https://github.com/intel/intel-xpu-backend-for-triton/commit/2dd9d74527f431e5e822b8e67c01900e4d0bfef3 removes `TritonGPUToLLVMBase.h`, we added `Target target` in `ConvertTriton…

whitneywhtsang updated 4 months ago
8
InternLM/lmdeploy #2543

[Bug] accelerate包发生'NoneType' object has no attribute '_para…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related iss…

mouweng updated 1 week ago
1
MahmoudAshraf97/whisper-diarization #195

Installation issues with Windows

I tried to follow the steps for my Windows PC but I'm facing the following issues: ``` (myenv) PS C:\Users\AI-Install\Documents\transcribe\whisper-diarization> pip install -r requirements.txt Col…

sharik-siddiqi updated 3 months ago
4

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for openai-triton

1000+ results
for openai-triton