inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-team/keras #18456

RNN not compatible with XLA (TF backend)

RNN cannot be jit compiled, see error below: ``` Detected unsupported operations when trying to compile graph __inference_one_step_on_data_993[] on XLA_GPU_JIT: CudnnRNN (No registered 'CudnnRNN' Op…

chenmoneygithub updated 3 months ago
3
facebookresearch/maskrcnn-benchmark #865

Batch size > 1 error

When I'm increasing the batch size to 2, I receive an error in boxlist_ops.py. This is for batch = 2: this is shape of proposals before going into cat_boxlist method len - 2 [BoxList(num_boxe…

shlokk updated 4 years ago
2
AlexeyAB/darknet #1240

yolov3 in caffe is slow

I tried to run yolo models in caffe (tools are available online to do the conversion.). However, I noticed that the inference time is much longer in caffe comparing to darknet framework. On my Quadro …

hjchai updated 6 years ago
3
typetools/checker-framework #6468

Non-termination of WPI on NJR Benchmarks for Nullness Checke…

#### Tested on CF versions `3.34.0` and `3.42.0` ---- I executed `WPI` on [NJR](https://zenodo.org/records/6314162) benchmarks to infer annotations for [Nullness Checker](https://checkerframewor…

nimakarimipour updated 7 months ago
2
pytorch/pytorch #123177

torch.compile+cudagraphs asserts in multithreaded context

``` import threading import torch def foo(x, y): a = torch.sin(x) b = torch.cos(y) return a + b opt_foo1 = torch.compile(foo, mode="max-autotune") threads = [] for _ in rang…

suo updated 2 months ago
7
fpgaminer/GPTQ-triton #1

Needs more VRAM than normal GPTQ CUDA version?

Thanks, I wanted to try your triton version. But I only have 8 GB RAM. The GPTQ Cuda versions works (7B model). Your version (the ppl script) crashes with CUDA OOM). Is that to be expected or c…

DanielWe2 updated 1 year ago
3
gicLAB/tvm-GSPC #1

where can I find the pytorch version of optimizing grouped c…

Thanks for participating in the TVM community! We use https://discuss.tvm.ai for any general usage questions and discussions. The issue tracker is used for actionable items such as feature proposals d…

lmomoy updated 4 years ago
1
microsoft/onnxruntime #19177

[Performance] Why run first inference so slow, although run …

### Describe the issue Build a class to create the model and inference. In initialition, created a random data and run one time. But when run other data, first inference is so slow, Why? If wait…

nistarlwc updated 8 months ago
12
intel/linux-npu-driver #48

openVINO not able to inference model with NPU

I have issues with NPU for dlstreamer in docker container. The build with NPU driver installation is complete without issue. But when I run the dlstreamer pipeline, there was error and the pipeline un…

azarulfahmin updated 1 month ago
1
NVIDIA/TensorRT-LLM #748

awq is very slow in my test

Hi dear: I tried awq quantization on codellama-13b according to https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/llama. After testing, it was very slow, 1.5 times slower than the floa…

shatealaboxiaowang updated 9 months ago
5

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark