Open CallMeAl1973 opened 1 year ago
Probably it's the temperature throttle.
Any way to turn it off or adjust parameters? The fans aren’t even noticeably running. Will need to figure out how to get a temperature report from the Linux CLI.
At Windows I use ThrottleStop to configure parameters like PROCHOT.
I don't know a Linux similar.
https://www.techpowerup.com/download/techpowerup-throttlestop/
You can also try adjusting the process priority to see if it makes a difference.
I’ve tried the –highpriority parameter and have also tried to renice the process to -20. Neither of those settings seem to have any significant effect. The CPUs clock down to 400 MHz either way. The best speed I’ve gotten was by actually disabling Speed Shift in the bios. Doing that locks my CPU clock speed at 1.4 GHz. That takes my performance on the airoboros-33b-gpt4-1.4.ggmlv3.q3_K_S.bin model from about 0.8T/s to 1.2T/s.
Another thing to try is messing with the thread counts. You can also check out the latest version which fixed a few earlier speed regressions.
So this is getting really strange. I’ve played with this some more and I’m actually seeing better performance when I use the powersave governor than when I use the performance governor. With the performance governor, I’m seeing about 0.8 T/s on 30b models, while with the powersave governor, I’m seeing about 1.3 t/s with the same models. Is there any way to roughly estimate how many t/s I should be expecting from my system? This is a Lenovo P360 tiny with an i9-12900T, 64 gigs ofDDR5,NVIDIA T1000, and an NVMe SSD.
I can only compare against my own machine. For 30b, I get about 900ms per token, I am offloading 10 layers to my RTX2060 and the rest is on Intel 9th gen i7 (cant remember exact model)
Hi all, I just noticed something interesting. For some reason, when koboldcpp is generating tokens, the clock speeds of my CPUs are actually slowing way down. This is a Lenovo P360 Tiny. The computer has a 12th gen i9, an NVME SSD, 64 gigs of ram and is running Debian 12. I used the cpufreq-info utility twice to gather the below info. The first snapshot was with Koboldcpp running but not generating tokens. The second snapshot was during the token generation process. Is this behavior expected? If not, how can I help to gather more useful info? Thanks,
Before
cpufrequtils 008: cpufreq-info (C) Dominik Brodowski 2004-2009 Report errors and bugs to cpufreq@vger.kernel.org, please. analyzing CPU 0: driver: intel_pstate CPUs which run at the same hardware frequency: 0 CPUs which need to have their frequency coordinated by software: 0 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 1: driver: intel_pstate CPUs which run at the same hardware frequency: 1 CPUs which need to have their frequency coordinated by software: 1 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 2: driver: intel_pstate CPUs which run at the same hardware frequency: 2 CPUs which need to have their frequency coordinated by software: 2 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 3: driver: intel_pstate CPUs which run at the same hardware frequency: 3 CPUs which need to have their frequency coordinated by software: 3 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 4: driver: intel_pstate CPUs which run at the same hardware frequency: 4 CPUs which need to have their frequency coordinated by software: 4 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 5: driver: intel_pstate CPUs which run at the same hardware frequency: 5 CPUs which need to have their frequency coordinated by software: 5 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 6: driver: intel_pstate CPUs which run at the same hardware frequency: 6 CPUs which need to have their frequency coordinated by software: 6 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.29 GHz. analyzing CPU 7: driver: intel_pstate CPUs which run at the same hardware frequency: 7 CPUs which need to have their frequency coordinated by software: 7 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 800 MHz. analyzing CPU 8: driver: intel_pstate CPUs which run at the same hardware frequency: 8 CPUs which need to have their frequency coordinated by software: 8 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 9: driver: intel_pstate CPUs which run at the same hardware frequency: 9 CPUs which need to have their frequency coordinated by software: 9 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 10: driver: intel_pstate CPUs which run at the same hardware frequency: 10 CPUs which need to have their frequency coordinated by software: 10 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 11: driver: intel_pstate CPUs which run at the same hardware frequency: 11 CPUs which need to have their frequency coordinated by software: 11 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 12: driver: intel_pstate CPUs which run at the same hardware frequency: 12 CPUs which need to have their frequency coordinated by software: 12 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 13: driver: intel_pstate CPUs which run at the same hardware frequency: 13 CPUs which need to have their frequency coordinated by software: 13 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 14: driver: intel_pstate CPUs which run at the same hardware frequency: 14 CPUs which need to have their frequency coordinated by software: 14 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 15: driver: intel_pstate CPUs which run at the same hardware frequency: 15 CPUs which need to have their frequency coordinated by software: 15 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 799 MHz. analyzing CPU 16: driver: intel_pstate CPUs which run at the same hardware frequency: 16 CPUs which need to have their frequency coordinated by software: 16 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 17: driver: intel_pstate CPUs which run at the same hardware frequency: 17 CPUs which need to have their frequency coordinated by software: 17 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 18: driver: intel_pstate CPUs which run at the same hardware frequency: 18 CPUs which need to have their frequency coordinated by software: 18 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 19: driver: intel_pstate CPUs which run at the same hardware frequency: 19 CPUs which need to have their frequency coordinated by software: 19 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 20: driver: intel_pstate CPUs which run at the same hardware frequency: 20 CPUs which need to have their frequency coordinated by software: 20 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 21: driver: intel_pstate CPUs which run at the same hardware frequency: 21 CPUs which need to have their frequency coordinated by software: 21 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 22: driver: intel_pstate CPUs which run at the same hardware frequency: 22 CPUs which need to have their frequency coordinated by software: 22 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 23: driver: intel_pstate CPUs which run at the same hardware frequency: 23 CPUs which need to have their frequency coordinated by software: 23 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz.
While
cpufrequtils 008: cpufreq-info (C) Dominik Brodowski 2004-2009 Report errors and bugs to cpufreq@vger.kernel.org, please. analyzing CPU 0: driver: intel_pstate CPUs which run at the same hardware frequency: 0 CPUs which need to have their frequency coordinated by software: 0 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 1: driver: intel_pstate CPUs which run at the same hardware frequency: 1 CPUs which need to have their frequency coordinated by software: 1 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 2: driver: intel_pstate CPUs which run at the same hardware frequency: 2 CPUs which need to have their frequency coordinated by software: 2 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 3: driver: intel_pstate CPUs which run at the same hardware frequency: 3 CPUs which need to have their frequency coordinated by software: 3 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 471 MHz. analyzing CPU 4: driver: intel_pstate CPUs which run at the same hardware frequency: 4 CPUs which need to have their frequency coordinated by software: 4 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 5: driver: intel_pstate CPUs which run at the same hardware frequency: 5 CPUs which need to have their frequency coordinated by software: 5 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 6: driver: intel_pstate CPUs which run at the same hardware frequency: 6 CPUs which need to have their frequency coordinated by software: 6 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 7: driver: intel_pstate CPUs which run at the same hardware frequency: 7 CPUs which need to have their frequency coordinated by software: 7 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 8: driver: intel_pstate CPUs which run at the same hardware frequency: 8 CPUs which need to have their frequency coordinated by software: 8 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 9: driver: intel_pstate CPUs which run at the same hardware frequency: 9 CPUs which need to have their frequency coordinated by software: 9 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 10: driver: intel_pstate CPUs which run at the same hardware frequency: 10 CPUs which need to have their frequency coordinated by software: 10 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 11: driver: intel_pstate CPUs which run at the same hardware frequency: 11 CPUs which need to have their frequency coordinated by software: 11 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.90 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.90 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 1.40 GHz. analyzing CPU 12: driver: intel_pstate CPUs which run at the same hardware frequency: 12 CPUs which need to have their frequency coordinated by software: 12 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 13: driver: intel_pstate CPUs which run at the same hardware frequency: 13 CPUs which need to have their frequency coordinated by software: 13 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 14: driver: intel_pstate CPUs which run at the same hardware frequency: 14 CPUs which need to have their frequency coordinated by software: 14 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 15: driver: intel_pstate CPUs which run at the same hardware frequency: 15 CPUs which need to have their frequency coordinated by software: 15 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 4.80 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 4.80 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 480 MHz. analyzing CPU 16: driver: intel_pstate CPUs which run at the same hardware frequency: 16 CPUs which need to have their frequency coordinated by software: 16 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 17: driver: intel_pstate CPUs which run at the same hardware frequency: 17 CPUs which need to have their frequency coordinated by software: 17 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 18: driver: intel_pstate CPUs which run at the same hardware frequency: 18 CPUs which need to have their frequency coordinated by software: 18 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 19: driver: intel_pstate CPUs which run at the same hardware frequency: 19 CPUs which need to have their frequency coordinated by software: 19 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 20: driver: intel_pstate CPUs which run at the same hardware frequency: 20 CPUs which need to have their frequency coordinated by software: 20 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 21: driver: intel_pstate CPUs which run at the same hardware frequency: 21 CPUs which need to have their frequency coordinated by software: 21 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 22: driver: intel_pstate CPUs which run at the same hardware frequency: 22 CPUs which need to have their frequency coordinated by software: 22 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz. analyzing CPU 23: driver: intel_pstate CPUs which run at the same hardware frequency: 23 CPUs which need to have their frequency coordinated by software: 23 maximum transition latency: 4294.55 ms. hardware limits: 800 MHz - 3.60 GHz available cpufreq governors: performance, powersave current policy: frequency should be within 800 MHz and 3.60 GHz. The governor "performance" may decide which speed to use within this range. current CPU frequency is 400 MHz.
Prerequisites
Please answer the following questions for yourself before submitting an issue.
Expected Behavior
Please provide a detailed written description of what you were trying to do, and what you expected
llama.cpp
to do.Current Behavior
Please provide a detailed written description of what
llama.cpp
did, instead.Environment and Context
Please provide detailed information about your computer setup. This is important in case the issue is not reproducible except for under certain specific conditions.
$ lscpu
$ uname -a
Failure Information (for bugs)
Please help provide information about the failure if this is a bug. If it is not a bug, please remove the rest of this template.
Steps to Reproduce
Please provide detailed steps for reproducing the issue. We are not sitting in front of your screen, so the more detail the better.
Failure Logs
Please include any relevant log snippets or files. If it works under one configuration but not under another, please provide logs for both configurations and their corresponding outputs so it is easy to see where behavior changes.
Also, please try to avoid using screenshots if at all possible. Instead, copy/paste the console output and use Github's markdown to cleanly format your logs for easy readability.
Example environment info:
Example run with the Linux command perf