tree-inference Search Results

1000+ results
for tree-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

onnx/onnx #6252

Vague response from quantised onnx model

# Bug Report Iam referring to [https://github.com/microsoft/onnxruntime-inference-examples/tree/main/quantization/language_model/llama/smooth_quant](https://github.com/microsoft/onnxruntime-inference…

ragesh2000 updated 1 month ago
1
DarshanAnand007/YOLO_V8_Object_Detection #1

You may do it faster 😉

I have 8,9 fps on yolov8n with tensorRT+ cpp

ViktorPavlovA updated 3 days ago
4
aliencube/azure-openai-sdk-proxy #21

[Chat Completions] Request payload definition

Definition of the request payload for the chat completions

justinyoo updated 1 week ago
2
google/yggdrasil-decision-forests #118

`INVALID_ARGUMENT: Too much categorical conditions` - how ma…

When I try to evaluate my model or make predictions on the val set, I get the following error: ``` --------------------------------------------------------------------------- ValueError …

AlexanderLavelle updated 1 week ago
8
shashikg/WhisperS2T #71

Add Lora Dynamic switching for inference

Dynamic LoRA (Low-Rank Adaptation) switching functionality, allowing users to change LoRA models on-the-fly during inference without reloading the entire model.

Jeevi10 updated 6 days ago
4
openvinotoolkit/openvino #23778

[Feature Request]: Add support for ZipMap and Tree Ensemble…

### Request Description I was trying to implement a CatBoost model inferencing via ONNX and ran into this error: ``` RuntimeError: Exception from src/inference/src/cpp/core.cpp:92: Check 'error_…

AbhijithGanesh updated 1 month ago
10
aliencube/azure-openai-sdk-proxy #22

[Chat Completions] Response payload definition

Definition of the response payload for the chat completions

justinyoo updated 1 week ago
2
huggingface/text-generation-inference #2263

Documentation about default values of model paramaters

### Feature request In the documentation, there is not enough info about the default values TGI enforces if client request do not contain parameters like `temperature`, `top_p`, `presence_frequency` …

mohittalele updated 1 month ago
3
PaddlePaddle/Paddle2ONNX #1131

OCR自动量化后导出为onnx失败

https://github.com/PaddlePaddle/PaddleSlim/tree/develop/example/auto_compression/ocr 该方案更换ICDAR2015数据集，采用预训练ResNet50模型（更改模型配置即可）可以成功运行，其精度基本不变，速度减少为1/4，获得Inference模型。此时的模型在转为ONNX时报错，缺少量化配置文件（cali…

xu-peng-7 updated 1 month ago
2
TabbyML/tabby #2657

Answer Engine Quality - Ideas

***Under Construction*** The Answer Engine, released in version 0.13, provides a Q&A interface for Tabby's users to interact with the LLM, optionally within the context of a connected repository. T…

wsxiaoys updated 6 days ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tree-inference

1000+ results
for tree-inference