inference-api Search Results

1000+ results
for inference-api

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openvinotoolkit/openvino #26357

Examples of notebooks/pixart cannot convert pixart model due…

**Describe the bug** When compiling to NPU, runtime error raised. ``` --> compiled_model = ov.compile_model(converted_model, device_name='NPU') RuntimeError: Exception from src/inference/src/c…

peterzheng98 updated 1 week ago
5
tensorflow/tensorflow #74843

Resizing the input shape in TensorFlow Lite C API

### Issue type Support ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version 2.13 ### Custom code Yes ### OS platform and distribution _No re…

keke444 updated 1 day ago
6
numba/llvmlite #1075

Aborted (Core dumped) with Symbol not found __gnu_f2h_ieee

## Reporting a bug When I called the LLVM package using tinygrad, I got the following erro. Aborted (Core dumped) with Symbol not found __gnu_f2h_ieee. This issue seems to have been resolved in ear…

HysenX-LI updated 2 weeks ago
3
sherlock-audit/2024-06-allora-judging #131

0x416 - Lack of error handling when making blockless api cal…

0x416 Medium # Lack of error handling when making blockless api call ## Summary Lack of error handling when making blockless api call ## Vulnerability Detail Error handling when making blockless…

sherlock-admin2 updated 3 weeks ago
2
ray-project/kuberay #2323

[RFC] Introduce new API-RayCluster Fleet and ReplicaSet in K…

### Search before asking - [X] I had searched in the [issues](https://github.com/ray-project/kuberay/issues) and found no similar feature requirement. /cc Bytedancer @Basasuya @Yicheng-Lu-llll …

Jeffwan updated 2 weeks ago
3
AILab-CVC/YOLO-World #351

How to make predictions using image bytes instead of image p…

For now I am doing inference on images following the code in the inference.ipynb file. However, I have realised that it uses image paths to be able to make the inference. However, due to limitatio…

XieKaiwen updated 1 month ago
6
elastic/elasticsearch #111898

Support text_generation models on Hugging Face

### Description Customer is interested in using Elasticsearch inference API with text generation models on Hugging Face where as of 8.15 we are limited supporting only text_embeddings

alaa-mallah updated 1 month ago
1
Dan-wanna-M/formatron #3

Efficient batched inference

While we support batched inference like other constrained decoding libraries, the current implementation can be parallelized further. In particular, we can mask logits in batch and run several `kbnf` …

Dan-wanna-M updated 2 weeks ago
5
vllm-project/vllm #7778

[Feature]: High throughput has not been achieved in decoding…

### 🚀 The feature, motivation and pitch I launched a LLM service by vllm, and I use AsyncOpenAI function for high throughput output. like this: ` async def async_llm_infer_sampling(prompt, a…

Liucd0520 updated 2 weeks ago
1
canonical/bundle-kubeflow #1067

e2e UATs failing with `inferenceservices.serving.kserve.io \…

### Bug Description The `e2e-wine-kfp-mlflow-kserve` test fails in Azure one-click deployment with the KServe `InferenceService` not being found. The pipeline run succeeds, then when the lightkube cl…

NohaIhab updated 2 days ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for inference-api

1000+ results
for inference-api