-
### Description of the bug:
which are supported ops in PT2E model, e.g. SiLU or GELU for conversion doesn't work at the moment, even though some are supported by tf-lite runtime e.g. GELU is support…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### Ultralytics YOLO Component
Pred…
-
```
hi
Email Address:::: rafe.torabi@gmail.com
help me ; help me plz I write code for Inference but it have error below:::
??? Error using ==> subsindex Function 'subsindex' is not defined for …
-
is it possible to release the model as serialized onnx file probably it's a good idea to release some sample code with onnx Inference engine with public restful API
-
In prune.py, find out if W_metric is missing X,should it be engine.forward(W,G,X)?
![1722996516687](https://github.com/user-attachments/assets/12fb8cbe-9f2b-45ce-aa79-2d4d38adfe53)
cquxl updated
2 months ago
-
## Bug report
**Describe the bug**
LLM Engine failed in ValidatedGraphConfig Initialization step.
### Steps to reproduce
Steps to reproduce the behavior:
1. Download gemma-2b-it-gpu-int8.…
-
**Describe the bug**
I am tryting to do batch inference, so the inputs needs padding. When using `replace_with_kernel_inject=True`, the engine output is incorrect. setting `replace_with_kernel_inject…
-
```
This will make the generated UI module more suitable for dynamic web such
as data grid and list
```
Original issue reported on code.google.com by `John.Jian.Fang@gmail.com` on 8 Feb 2009 at 7:24…
-
# Reference
- [ ] [inference-engine/tools/benchmark_tool](https://github.com/opencv/dldt/blob/2020/inference-engine/tools/benchmark_tool/benchmark_app.py)
# Brief
## [Calculate Latency & Th…
-
**Describe the bug**
When using pipelining (with or without `LayerSpec` inside `PipelineModule`), the first GPU seems to have a considerably higher memory consumption, compared to the other ones. T…