onnx-runtime Search Results

1000+ results
for onnx-runtime

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #22511

[Performance] C++ api: destroy the execution provider if the…

### Describe the issue With QNN execution provider we see that loading the first model ~800 MB of memory is allocated and after loading each model, ~100MB of memory is allocated again. When destroyin…

kristoftunner updated 1 week ago
3
ollama/ollama #6502

ONNX backend runtime support to simplify HW support?

A repetitive issue I see coming up over and over again is people not being able to run models on their hardware for any number of reasons, one of the biggest being that llama.cpp has not incorporated …

TheSpaceGod updated 3 months ago
3
microsoft/Phi-3CookBook #211

Fail to run Phi-3 Model with DirectML + ONNX Runtime in ARM6…

I am currently using Surface Pro 11 to reproduce [AIPC_Inference.md#2-use-directml--onnx-runtime-to-run-phi-3-model](https://github.com/microsoft/Phi-3CookBook/blob/main/md/03.Inference/AIPC_Inference…

yuting1008 updated 2 weeks ago
2
dotnet/machinelearning-modelbuilder #2978

Error "ONNX Runtime only *guarantees* support for models sta…

**System Information (please complete the following information):** - Model Builder Version 17.18.4.2425601: - Visual Studio Version 17.11.4 **Describe the bug** Starting today i get an excep…

marxxxx updated 2 days ago
8
cvg/GeoCalib #8

RuntimeError: Unsupported Input Type When Exporting Model to…

I am trying to export a custom GeoCalib model to ONNX. The model uses LMOptimizer, which accepts a custom class Pinhole (inherited from BaseCamera) as input. When I attempt to export the model, I en…

JiyouSeo updated 1 month ago
1
microsoft/onnxruntime #22437

The EP_CTX_BLOB seems to have both WRITE and EXECUTABLE perm…

### Describe the issue The EP_CTX_BLOB (The compiled model saved as an ONNX blob on the disk) seems to have WRITE and EXECUTABLE permissions enabled. Since a compiled blob is meant to be read from, I…

vthaniel updated 1 month ago
3
microsoft/onnxruntime #22723

FP16 ONNX model outputs NaN after the first successful execu…

### Describe the issue I have an FP16 (half-precision floating point) ONNX model. When I load and execute this model using the onnxruntime library in Python, the first execution is successful and pro…

MiningIrving updated 3 weeks ago
2
microsoft/onnxruntime #22640

AttributeError: FLOAT8E4M3FN during quantization

### Describe the issue The preprocess step for quantization does not work with the latest onnxruntime version: ```shell python -m onnxruntime.quantization.preprocess --input image_resize.onnx --outp…

maaft updated 3 weeks ago
1
triton-inference-server/server #7677

Triton ONNX runtime backend slower than onnxruntime python c…

**Description** When deploying an ONNX model using the Triton Inference Server's ONNX runtime backend, the inference performance on the CPU is noticeably slower compared to running the same model usi…

Mitix-EPI updated 1 month ago
2
microsoft/onnxruntime #22439

[Mobile]

### Describe the issue I encountered the following error when loading a model with dynamic shapes using the QNN Provider as the backend acceleration setting. ![Image](https://github.com/user-attach…

sushuChina updated 2 weeks ago
3

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for onnx-runtime

1000+ results
for onnx-runtime