ggml Search Results - Githubissues

1000+ results
for ggml

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/BitNet #122

Trying to setup Llama3-8B-1.58-100B-tokens with i2_s

I manually downloaded the model and set the model with the command "python setup_env.py -md .\models\Llama3-8B-1.58-100B-tokens -q i2_s" in Windows 11 OS. The result shows: "ERROR:root:Error occurred…

yujiapingff updated 3 days ago
1
abdeladim-s/pywhispercpp #82

[question] how to suppress model initiation logging

When the whisper model is loaded, it prints a lot of initialization information to the console. I'd like to be able to redirect this to a separate log file and silence the console output. `llama-c…

benniekiss updated 1 week ago
5
leejet/stable-diffusion.cpp #464

When trying to build with SD_HIPBLAS I get a CUDA compilatio…

Following the build instructions in the readme, ``` cmake .. -G "Ninja" -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DSD_HIPBLAS=ON -DCMAKE_BUILD_TYPE=Release -DAMDGPU_TARGETS=gfx1100 …

lcarsos updated 1 week ago
2
airockchip/rknn-llm #119

GGML_ASSERT(view_src == NULL || data_size == 0 || data_size …

在运行rkllm时，user内容输入后，robot回答报错： robot: :0: GGML_ASSERT(view_src == NULL || data_size == 0 || data_size + view_offs

sdrzmgy updated 1 week ago
1
ggerganov/llama.cpp #10180

ggml : refactor ggml-cpu.c into multiple C++ source files

As per recent discussions (e.g. https://github.com/ggerganov/llama.cpp/pull/10144#pullrequestreview-2411814357), we should split the large `ggml-cpu.c` implementation into smaller modules - similar to…

ggerganov updated 2 weeks ago
11
leejet/stable-diffusion.cpp #469

Support for shuttle-3-diffusion

I downloaded the weights from https://huggingface.co/shuttleai/shuttle-3-diffusion, the program loaded the weights and exit for no error message. I debugged the program, it seems that the problem i…

bombless updated 6 days ago
8
ggerganov/ggml #1005

Unexpected behavior for GGML_MEAN

The usual behavior for the "mean" operation in numerical frameworks is a reduction of a tensor to a single value. However, in GGML this operation instead calculates the mean *per row*. This is I think…

JohannesGaessler updated 22 hours ago
5
tenstorrent/tt-metal #14502

[Feature Request] Relax broadcasting rules for binray operta…

**Is your feature request related to a problem? Please describe.** Currently binary TTNN operators follows the NumPy broadcasting rules. i.e. only dimensions of (implied) 1 can be broadcasted. Eg the…

marty1885 updated 2 weeks ago
1
leejet/stable-diffusion.cpp #463

Flash Attention CPU-only build fails in cross-compilation fo…

~~When cross-compiling for Android using NDK toolchain, Flash Attention fails to build in CPU-only mode but succeeds when Vulkan backend is enabled, despite being documented as CPU-only feature.~~ …

rmatif updated 3 days ago
1
ROCm/rocWMMA #464

[Issue]: How to build rocWMMA on Windows ?

### Problem Description Adding New gfx model gfx1151 to Linux , it can build on Linux also I can build the llama cpp with rocWMMA patch https://github.com/ggerganov/llama.cpp/pull/7011/commits to …

Jay19751103 updated 12 hours ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for ggml

1000+ results
for ggml