vnni Search Results - Githubissues

1000+ results
for vnni

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

janhq/cortex.llamacpp #60

feat: CI for VNNI support

**Problem** Lack support for VNNI **Success Criteria** **Additional context**

vansangpfiev updated 1 month ago
3
microsoft/onnxruntime #22533

[Feature Request] Add Wasm Relaxed SIMD support and integer …

### Describe the feature request Wasm Relaxed SIMD includes integer dot product instructions, which will map to VNNI instructions on X86-64 platforms with AVX-VNNI (on ARM maybe SDOT, but I haven't t…

jing-bao updated 3 weeks ago
3
microsoft/BitNet #102

Certain characters crash bitnet model inference?

I've been working on securing the user input, escaping invalid characters, however I've encountered a few prompts which cause the llama-cli to abruptly halt: ``` .\llama-cli.exe --model "..\..\..\mod…

grctest updated 1 week ago
6
ggerganov/llama.cpp #10297

Bug: I am unable to use llama_cli interactively

### What happened? I use -i -if and the flags are ignored, and it exists with "input is empty" lama_new_context_with_model: graph nodes = 2246 llama_new_context_with_model: graph splits = 1 co…

phalexo updated 3 days ago
7
WyvernTKC/cpuminer-gr-avx2 #145

VNNI instruction support

Some Intel Xeon server CPUs (for example _Xeon Platinum 8171M_ or _Xeon Platinum 8272CL_) support VNNI instruction. Is this something which chould be used for better performance or it is not suited fo…

marki555 updated 2 years ago
1
dotnet/runtime #86849

[API Proposal]: Add support for AVX-512 VNNI hardware instru…

### Background and motivation There already is support for AVX VNNI hardware instruction set with support for 128-/256-bit vectors and it would be good to have same support for 512-bit vectors. (ve…

MadProbe updated 1 week ago
17
robertknight/rten #389

WASM relaxed SIMD support

The WASM [Relaxed SIMD](https://github.com/WebAssembly/relaxed-simd) instructions were stabilized in [Rust v1.82](https://blog.rust-lang.org/2024/10/17/Rust-1.82.0.html#stabilized-apis). This inclu…

robertknight updated 5 days ago
1
UKPLab/sentence-transformers #3006

Onnx quantized backend for Clip-ViT-B-16

@tomaarsen Just wanted to know if clip (text + image) embedding models will have an onnx quantized model? i tried finding it everywhere but had no luck. If it is there can you please point me to it?…

PraNavKumAr01 updated 4 weeks ago
1
ollama/ollama #2281

Support GPU runners with AVX2

I am running ollama on i7-14700K, which supports AVX2 and AVX_VNNI, and a GeForce RTX 1060. After reading #2205, I enable `OLLAMA_DEBUG=1` to check if ollama utilize AVX2 of this CPU. But unlike th…

hyjwei updated 2 weeks ago
7
ollama/ollama #2205

Support additional AVX instruction sets

I have a intel CPU that supports a number of AVX features, but most of them are not picked up when using ollama. Below is the llama.log file: system info: AVX = 1 | AVX2 = 0 | AVX512 = 0 | AVX512_…

ddpasa updated 2 weeks ago
19

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vnni

1000+ results
for vnni