vnni Search Results - Githubissues

1000+ results
for vnni

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LostRuins/koboldcpp #1011

Add support for Tekken pre-tokenizer to support Nemo 12B

**Describe the Issue** Mistral/Nvidia recently released [Nemo 12B](https://mistral.ai/news/mistral-nemo/) and llama.cpp have [added support](https://github.com/ggerganov/llama.cpp/pull/8579) for its …

invisietch updated 3 months ago
11
pytorch/pytorch #21120

[RFC] Adding MKL-DNN Int8 functions to PyTorch/Aten/JIT back…

Migrate the Caffe2/MKL-DNN int8 operation to support Aten/JIT backend and align with Qint8 direction in Pytorch/Aten Motivation With Cascadelake/VNNI, MKL-DNN int8 functions can speedup DL m…

Jianhui-Li updated 6 months ago
6
pytorch/pytorch #140759

torch.clear_autocast_cache is not traceable

### 🐛 Describe the bug When trying to `torch.compile` a module that contains `torch.clear_autocast_cache` we get the attached error. I believe this is expected but wondering if there is an establishe…

proleu updated 4 days ago
1
kherud/java-llama.cpp #83

Exception Access Violation in Windows x86-64 when running in…

When running newer versions (from 3.3.0 higher) with any model, the JVM crashes.: Extracted 'ggml.dll' to 'C:\Users\user\AppData\Local\Temp\ggml.dll' Extracted 'llama.dll' to 'C:\Users\user\AppDat…

kyselat updated 1 week ago
3
onnx/models #581

Quantized model test data on GPU

# Ask a Question Since the GPU machines of CI have been upgraded from NV6 to T4, it looks quantized model on GPU should be added too. `Hardware support is required to achieve better performance with…

mszhanyi updated 1 year ago
2
intel/torch-xpu-ops #891

`addmm` will throw `unknown type name 'PO_1_BIN_ARG_DATA_T'`…

### 🐛 Describe the bug Reproducing step: 1. enable `test/inductor/test_torchinductor_opinfo.py` with this PR: https://github.com/pytorch/pytorch/pull/134556 2. `python test/inductor/test_torchin…

hoshibara updated 17 hours ago
1
intel/neural-speed #174

Performance Gap between Neural Speed Matmul Operator and Lla…

I’ve discovered a performance gap between the Neural Speed Matmul operator and the Llama.cpp operator in the Neural-Speed repository. This issue was identified while running a benchmark with the ONNXR…

aciddelgado updated 5 months ago
13
BCCDC-DSI/RADD #43

Sockeye: disabling tensorflow warning msgs, FYI

Add following: ``` import os os.environ['TF_CPP_MIN_LOG_LEVEL'] = '2' ``` To reduce some of these warnings, according [stackoverflow](https://stackoverflow.com/questions/66092421/how-to-rebui…

lisatwyw updated 2 weeks ago
1
intel/intel-extension-for-pytorch #701

Run finetune.py on Xeon but failed - no attribute "weight"

### Describe the bug To finetune model on Xeon CPU, we are following the [ai-reference-models/models_v2/pytorch/llama/training/cpu at main · intel/ai-reference-models (github.com)](https://github.com…

JamieVC updated 1 week ago
11
haotian-liu/LLaVA #199

[Question] Why execute eval ScienceQA When sciqa train data…

### Question Hello, I have two questions: **1. I used the same jsonl results, but the scores evaluated were different, the results are shown below.** ` 2023-05-29 09:07:24.546966: I tensorflow/c…

mary-0830 updated 1 year ago
5

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for vnni

1000+ results
for vnni