inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mit-han-lab/smoothquant #70

w8a8 Does it require dequantization during forward inference…

hi, thank you for your open source. I have a few questions about the reasoning of quantitative models. （1） if for the model with only W8A8 quantization, but kv cache does not quantize, whether the fo…

shatealaboxiaowang updated 9 months ago
1
drizzle-team/drizzle-orm #968

[BUG]: Incorrect type inference on relational query

### What version of `drizzle-orm` are you using? 0.27.2 ### What version of `drizzle-kit` are you using? 0.19.12 ### Describe the Bug My prepared statement is correctly returning the data I would…

williamlmao updated 3 months ago
9
keras-team/keras #18456

RNN not compatible with XLA (TF backend)

RNN cannot be jit compiled, see error below: ``` Detected unsupported operations when trying to compile graph __inference_one_step_on_data_993[] on XLA_GPU_JIT: CudnnRNN (No registered 'CudnnRNN' Op…

chenmoneygithub updated 3 months ago
3
facebookresearch/maskrcnn-benchmark #925

Can't find where the weights are stored

## ❓ Questions and Help Hi, Please I cannot find where the inference weights used for maskrcnn in demo.ipynb are stored. Also, in a do a training on coco, where are my weights saved and where is …

M-Alami updated 5 years ago
3
balezz/LacmusTflite #4

Quantized tflite model

Now we have tflite model without any optimizations. Please, add some optimizations and corresponding benchmarks for it.

balezz updated 2 years ago
3
PaddlePaddle/PaddleDetection #6208

pp-picodet 主体检测导出模型后执行python deploy/python/infer.py 遇到问题 Va…

### 问题确认 Search before asking - [x] 我已经查询[历史issue](https://github.com/PaddlePaddle/PaddleDetection/issues)，没有报过同样bug。I have searched the [issues](https://github.com/PaddlePaddle/PaddleDetection/issue…

yuanjim updated 2 years ago
2
fpgaminer/GPTQ-triton #1

Needs more VRAM than normal GPTQ CUDA version?

Thanks, I wanted to try your triton version. But I only have 8 GB RAM. The GPTQ Cuda versions works (7B model). Your version (the ppl script) crashes with CUDA OOM). Is that to be expected or c…

DanielWe2 updated 1 year ago
3
pytorch/pytorch #123177

torch.compile+cudagraphs asserts in multithreaded context

``` import threading import torch def foo(x, y): a = torch.sin(x) b = torch.cos(y) return a + b opt_foo1 = torch.compile(foo, mode="max-autotune") threads = [] for _ in rang…

suo updated 2 months ago
7
microsoft/onnxruntime #19177

[Performance] Why run first inference so slow, although run …

### Describe the issue Build a class to create the model and inference. In initialition, created a random data and run one time. But when run other data, first inference is so slow, Why? If wait…

nistarlwc updated 8 months ago
12
facebookresearch/maskrcnn-benchmark #865

Batch size > 1 error

When I'm increasing the batch size to 2, I receive an error in boxlist_ops.py. This is for batch = 2: this is shape of proposals before going into cat_boxlist method len - 2 [BoxList(num_boxe…

shlokk updated 4 years ago
2

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark