ai-inference Search Results

1000+ results
for ai-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

containers/podman-desktop-extension-ai-lab #1592

Adjust Inference Server details page UX

### Is your enhancement related to a problem? Please describe While the inference server page is listing the information, those are not easy to decipher. And we would like to introduce more sections …

slemeur updated 3 weeks ago
3
roboflow/inference #316

GPU Acceleration Doesn't Work with CUDA 12

### Search before asking - [X] I have searched the Inference [issues](https://github.com/roboflow/inference/issues) and found no similar bug report. ### Bug If you install `inference-gpu` on a mac…

yeldarby updated 1 month ago
2
elastic/kibana #192962

[openAI] use usage stats for token counts

OpenAI can now exposes usage stats for the stream completion APIs https://community.openai.com/t/usage-stats-now-available-when-using-streaming-with-the-chat-completions-api-or-completions-api/738156…

pgayvallet updated 3 weeks ago
2
microsoft/autogen #2989

[Issue][Discussion]: Use of "name" field for messages

### Describe the issue This issue is a place to discuss the impact of not being able to rely on the `name` field on messages and existing, or proposed, solutions to cater for this. --- The `n…

marklysze updated 4 days ago
2
janhq/jan #3737

bug: Jan is not using GPU

### Jan version 0.5.4 ### Describe the Bug Jan is not using GPU ### Steps to Reproduce Model: Bielik-11B-v2.3-Instruct.Q8_0.gguf GPU: NVIDIA RTX 4070 SUPER ### Screenshots / Logs ![obraz_2024-…

majsterkovic updated 3 days ago
7
kserve/kserve #3736

add Xinfernece ( an inference platform which integrated tran…

/kind feature **Describe the solution you'd like** Hope add [https://github.com/xorbitsai/inference](https://github.com/xorbitsai/inference) as the kserve huggingface LLMs serving runtime Xor…

jaffe-fly updated 1 month ago
5
opensearch-project/ml-commons #2823

[BUG] Failed to deploy model

### Describe the bug [2024-08-08T07:17:44,731][WARN ][a.d.h.t.HuggingFaceTokenizer] [opensearch-ml-node] maxLength is not explicitly specified, use modelMaxLength: 512 [2024-08-08T07:17:44,737][ER…

jlibx updated 1 week ago
3
irthomasthomas/undecidability #886

Announcing Together Inference Engine 2.0 with new Turbo and …

- [ ] [Announcing Together Inference Engine 2.0 with new Turbo and Lite endpoints](https://www.together.ai/blog/together-inference-engine-2) # Announcing Together Inference Engine 2.0 with new Turbo …

ShellLM updated 1 month ago
1
xihajun/test-comments #19

Benchmarking - Qualcomm Cloud AI - MLPerf Inference

http://127.0.0.1:8000/krai_qaic_task/benchmark/QuickBenchmarking

xihajun updated 1 year ago
1
xihajun/test-comments #23

License - Qualcomm Cloud AI - MLPerf Inference

http://127.0.0.1:8000/tmp/License/#device-details QAIC

xihajun updated 1 year ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for ai-inference

1000+ results
for ai-inference