llm-cpu Search Results - Githubissues

1000+ results
for llm-cpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #3847

[Bug]: distributed model example with num_gpus does not use …

### Your current environment ```text The output of `python collect_env.py` ``` ``` Versions of relevant libraries: [pip3] mypy-extensions==0.4.3 [pip3] numpy==1.23.5 [pip3] torch==2.0.1+cu11…

stikkireddy updated 6 months ago
2
oobabooga/text-generation-webui #6270

Update dependencies for getting LLAMA 3.1 to work

Now LLAMA 3.1 is out, but sadly it is not loadable with current text-generation-webui. I tried to update transformers lib which makes the model loadable, but I further get an error when trying to use …

reuschling updated 2 months ago
11
ollama/ollama #3166

Please add the memory requirement estimate if run with cpu a…

### What are you trying to do? I want to know if my computer can support the model or not, but currently no one can tell me. ### How should we solve this? Add the memory needed for each model tag i…

JerryYao75 updated 5 months ago
2
ggerganov/whisper.cpp #1896

talk-llama crash during initialization with opencl and NVIDI…

When trying to run the talk-llama example code with OpenCL enabled using a NVIDIA GeForce GT 755M, I get the following crash: ``` % LC_ALL=C ./obj-x86_64-linux-gnu/bin/talk-llama -mw ../nb-large-g…

petterreinholdtsen updated 7 months ago
2
KoljaB/LocalAIVoiceChat #7

Couqi Engine takes brakes mid sentence to load.

Couqi Engine takes brakes mid sentence to load. IT takes sometimes between words or even in the middle of say the word. I tried to adjust setting but nothing works. I use i7 10th and RTX3060 computer.

tomwarias updated 1 week ago
6
nektos/act #2239

Error: failed to start container: Error response from daemon…

### Bug report info ```plain text ➜ llm_playground git:(main) act --bug-report act version: 0.2.60 GOOS: darwin GOARCH: arm64 NumCPU: …

startakovsky updated 3 months ago
9
THUDM/CodeGeeX2 #78

多gpu、fastllm和量化不能同时使用吗

按照这个代码来看，多gpu、fastllm和量化不能同时使用吗 ``` def get_model(args): if not args.cpu: if torch.cuda.is_available(): device = f"cuda:{args.gpu}" elif torch.backends.mps.is_bui…

wozwdaqian updated 1 year ago
1
vllm-project/vllm #8188

[Usage]: What's the minimum VRAM needed to use entire contex…

### Your current environment Libraries Installed - ``` "vllm==0.5.5", "torch==2.4.0", "transformers==4.44.2", "ray", "hf-transfer", "huggingface_hub" ``` ### How would you like to u…

aflah02 updated 1 week ago
3
mudler/LocalAI #1560

Mac os native build not working

**LocalAI version:** v2.4.1 **Environment, CPU architecture, OS, and Version:** MBP 14 M1 PRO **Describe the bug** Not working make build and make BUILD_TYPE=metal build **To Reproduce…

glebnaz updated 5 months ago
17
ray-project/ray-llm #143

RAY-LLM stuck at replica step

Hi, I'm trying to run a rayllm as the tutorial in README. But now my serving seemed to stuck at replica. It looked like this: ![image](https://github.com/ray-project/ray-llm/assets/101038773/…

NBTrong updated 6 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llm-cpu

1000+ results
for llm-cpu