tree-inference Search Results

1000+ results
for tree-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-embeddings-inference #354

Request support for Llama Prompt Guard

### Feature request This is a Bert based model however when trying to run, the message says model not supported. https://huggingface.co/meta-llama/Prompt-Guard-86M/tree/main ### Motivation LLM-pow…

bluenevus updated 3 weeks ago
3
ARM-software/armnn #784

Model runs slower with ARM-NN than with XNNPACK on Cortex A5…

I have experimented multiple models with ARM-NN on Cortex A53(mostly int8 quantized models with latency < 200ms). And I found XNNPACK generally gives a better latency result than ARM-NN. So I am tryin…

answerdon updated 3 weeks ago
1
openvinotoolkit/openvino #25721

[Bug]: Bug report from static analysis 2.

### OpenVINO Version https://github.com/openvinotoolkit/openvino/tree/2d8ac08bf1f87f8ac455eae381213b52e781fe8c ### Operating System Windows System ### Device used for inference CPU ### Framework…

Nanamuru updated 3 weeks ago
2
danbar/fglib #8

Inference Does Not Work for Non-Tree Factor Graphs

It looks like inference is not working for non-tree structures. For example consider the following simple factor graph with nodes x1, x2, x3 and factors fa, fb, fc. ```python from fglib import grap…

studentbrad updated 2 years ago
6
eurodatacube/eodash #2645

Integration of openEO process

It is expected that (at least) 3 openEO processes will become available which should be integrated into GTIF. The integration concept would be similar to the one done for ship detections in the RACE …

santilland updated 1 week ago
4
haotian-liu/LLaVA #1503

[Usage] How to run inference for llava-next-72b?

### Describe the issue Issue: How to run inference for llava-next-72b/llava-next-110b? There are too many versions of your llava, and it seems that the code is not compatible, and there are mul…

nomadlx updated 3 months ago
25
onnx/onnx #6252

Vague response from quantised onnx model

# Bug Report Iam referring to [https://github.com/microsoft/onnxruntime-inference-examples/tree/main/quantization/language_model/llama/smooth_quant](https://github.com/microsoft/onnxruntime-inference…

ragesh2000 updated 1 month ago
1
InternLM/lmdeploy #1637

[Feature]- Support for the microsoft/Phi-3-vision-128k-instr…

### Motivation The latest release of microsoft phi3 4.2b 128k context vision model looks promising in performance and resource saving one too as it boast just 4.2b parameter. So it would be a great f…

sabarish244 updated 2 months ago
3
facebookresearch/segment-anything-2 #284

Is it possible to use Nvidia TensorRT to accelerate SAM2 in…

I`m not quite familiar with the Transformer model. There are more steps to do than other model with the Encoder and Decoder. Such as the last encoder block output needs to be as the input for the nex…

jackwei86 updated 1 day ago
10
mlcommons/inference #1747

Running Automated command for llama2-70b without downloading…

Hello mlcommons team, I want to run the "Automated command to run the benchmark via MLCommons CM" (from the example: https://github.com/mlcommons/inference/tree/master/language/llama2-70b), but I d…

philross updated 2 months ago
4

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for tree-inference

1000+ results
for tree-inference