inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HyperGAI/HPT #5

With a webui demo maybe more easier to try

Using command line to inference a same image with multiple questions maybe not convenient. If there is a webui that maybe more easier. I make a simple gradio webui demo for this as follow: ``` impor…

SuyongSun updated 4 months ago
1
ex3ndr/llama-coder #29

Error during inference: fetch failed

I have an ollama container running the stable-code:3b-code-q4_0 model. I'm able to interact with the model via curl: `curl -d '{"model":"stable-code:3b-code-q4_0", "prompt": "c++"}' https://notarea…

cadeff01 updated 7 months ago
5
microsoft/onnxruntime #19443

[Web] Memory Access Out of Bounds Error When Using ONNX Runt…

### Describe the issue I'm encountering a "memory access out of bounds" error when attempting to run inference using onnxruntime-web within a custom npm package. The inference process works flawles…

alba-saco updated 7 months ago
4
triton-inference-server/openvino_backend #62

Enable iGPU/dGPU plugin of OpenVINO

This is a placeholder for the task that will enable usage of Intel GPU in Triton via OpenVINO.

AlexKoff88 updated 6 months ago
2
sign/translate #57

Support Offline Translation Between SignWriting and Spoken L…

### Problem The application currently relies on a server endpoint for spoken-language to SignWriting and SignWriting to spoken-language text-to-text translation. This prevents us from performing…

AmitMY updated 3 months ago
3
X-PLUG/mPLUG-DocOwl #9

Models Weights

@LukeForeverYoung Hey! Thanks for sharing this amazing work! Are the model weights and inference code available ? I would be happy to test them locally.

matankley updated 5 months ago
3
urchade/GLiNER #135

Onnx runtime error with input_shape_size == size was false

When i try to load and run the onnx model, I am getting the following error message. I ran the code from https://github.com/urchade/GLiNER/blob/main/examples/convert_to_onnx.ipynb to save as onnx mod…

gsasikiran updated 1 month ago
9
opendatalab/MinerU #570

分页并行以及模块化处理

从 MinerU 的底层代码来看，似乎每一页 PDF 都是一个独立的处理单元，使用简单的 for-loop 依次处理，不存在拼页凑 block 的步骤。未来是否考虑加入并行处理的机制，分页后根据资源情况同时处理不同的页对象。最后再按照 page_index 拼接。理论上可行，但我看了下调用和加载模型的逻辑，不管是协程，多线程还是多进程，在调用 paddle 模型的时候都会有问题。 …

QIN2DIM updated 2 weeks ago
13
w-okada/voice-changer #1065

[ISSUE]: cannot use rvc but can use beatrice

### Voice Changer Version MMVCServerSIO_win_onxxgpu-cuda_v.1.5.3.17b ### Operational System Windows 11 Home 64-bit (10.0, Build 22621) ### GPU NVIDIA GeForce RTX 2050 ### Read carefully and chec…

kahtcw updated 8 months ago
5
abetlen/llama-cpp-python #771

Add batched inference

- [x] Use `llama_decode` instead of deprecated `llama_eval` in `Llama` class - [ ] Implement batched inference support for `generate` and `create_completion` methods in `Llama` class - [ ] Add suppo…

abetlen updated 1 month ago
34

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for inference-server

1000+ results
for inference-server