inference-engine Search Results

1000+ results
for inference-engine

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FeiYull/TensorRT-Alpha #112

onnx转换trt报错

[04/10/2024-16:11:31] [W] [TRT] onnx2trt_utils.cpp:366: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32. [04/10…

Yi-hash1 updated 7 months ago
1
Hanson0910/MNNSuperGlue #2

pth to mnn conversion script

As in this code-base direct .`mnn` weight files are provided which are parsed using the custom CPP module and then infered using MNN inference engine, but one of you comments you mentioned of first co…

andro-galexy updated 3 years ago
1
google/flutter-mediapipe #58

🐛 [mediapipe_genai] Unable to Submit IOS App - arm64_libllm_…

## Bug report **Describe the bug** Unable to distribute the example app to Testflight for internal testing. Error: Runner.app/Frameworks/arm64_libllm_inference_engine.framework does not support t…

aaronrau updated 5 months ago
3
fritzboyle/openvino-with-fpga-hello-world-face-detection #5

Cannot compile face detection: samples/slog.hpp

Hi, I'm working on OpenVINO with FPGA support. I can run the example as shown in [Run a Sample Application] section of "https://software.intel.com/en-us/articles/OpenVINO-Install-Linux-FPGA", and …

patrick14739itri updated 5 years ago
6
CarkusL/CenterPoint #26

centerhead decode

I want to know why the centerhead onnx do not contain decode part. If I make the centerhead decode part into tensorrt *.engine, how it would infulence the inference speed.

realwenpeng updated 11 months ago
1
QwenLM/Qwen2-VL #492

OSError: Error no file named model.safetensors found in dire…

Hi, I have finetuned Qwen2-VL using Llama-Factory. I successfully quantized the fine-tuned model as given ``` from transformers import Qwen2VLProcessor from auto_gptq import BaseQuantizeC…

bhavyajoshi-mahindra updated 4 days ago
3
michaelfeil/infinity #331

Permit loading of models at different precision at load time…

### Feature request Pass in `torch_dtype` in model_kwargs, as supported by sentence_transformers when specifying dtype in the infinity_emb v2 cli when InferenceEngine type is torch. This would all…

dawu415 updated 3 months ago
2
PygmalionAI/aphrodite-engine #518

[Usage]: native nvlink support or not agnostic to mobo

### Your current environment running isolated on docker container ### How would you like to use Aphrodite? I have the following question, currently nvlink support on new motherboards that …

puppetm4st3r updated 2 months ago
1
NVIDIA/TensorRT-LLM #1575

Error occured when running medusa inference.

Hi, when i use medusa decoding on trtllm-090 which profiling, error occrued as follows. Could you please help to have a look? Thanks! If i do not use `--run_profiling`, the inference process is nor…

littletomatodonkey updated 4 days ago
3
NVIDIA/TensorRT #4067

Multi streaming inference Using TensorRT

## Description Hi, I'am using multi-stream to improve TensorRT inference Latency & Throughput. Here' the inference code I modified from TensorRT repo's example. [common_runtime.py](link=https://githu…

Valerianding updated 3 months ago
2

上一页 1...72 73 74 75 76 77 78...100 下一页

1000+ results for inference-engine

1000+ results
for inference-engine