llm-cpu Search Results - Githubissues

1000+ results
for llm-cpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

RetellAI/retell-custom-llm-python-demo #17

How to deploy into VPS? or Heroku?

I tried to do it on vps but it doesnt' work. should I consider updating source code? Appreciate your help.

kukovinetsyevhenii updated 3 months ago
2
intel-analytics/ipex-llm #11079

Unable to get LanguageBind/Video-LLaVA-7B-hf model working t…

Hi there, I am able to download model from HF using VideoLlavaForConditionalGeneration.from_pretrained and optimize the model using ipex-llm.optimize_model(). But the process fails on generate() with …

arisha07 updated 4 months ago
13
yuanxion/Text2Video-Zero #7

Basic knowledge sharing of T2V PoC

T2V is planned to enable inferencing LLMs like Stable Diffusion on CPU/GPU, and training on Habana Gaudi/DG2, as well as improving the Generated Video quality, like more realistic frame, and coherency…

yuanxion updated 1 year ago
5
vllm-project/vllm #3568

[Bug]: (raylet) file_system_monitor.cc:111: /tmp/ray/session…

### Your current environment Collecting environment information... PyTorch version: N/A Is debug build: N/A CUDA used to build PyTorch: N/A ROCM used to build PyTorch: N/A OS: Debian GNU/Lin…

nhajji3 updated 5 days ago
3
salesforce/LAVIS #512

when I try the demo of instruct-blip with vicuna7b，I got wro…

I only have a 16GB graphics card, so I used the CPU to run it，My code is like: **** import torch from PIL import Image from lavis.models import load_model_and_preprocess device = "cpu" raw_ima…

Jiushanhuadao updated 6 months ago
2
NVIDIA/TensorRT-LLM #1798

Medusa with Mixtral 8x7B

Hello! Does TensorRT-LLM supports Medusa with Mixtral 8x7B? My understanding is that right now the Medusa [convert_checkpoint.py](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/medusa/c…

v-dicicco updated 3 months ago
12
8421BCD/DemoRank #1

Error

Hi, I am using `python version 3.9`, `CUDA version 12.2`. I installed the required packages in the README. I also changed the `model_name_or_path` to `--model_name_or_path google/flan-t5-xl \` and `C…

Tizzzzy updated 3 months ago
1
ollama/ollama #5939

Error: invalid file magic when trying to import gte-Qwen2-7B…

### What is the issue? **I got this error:** root@bccf6f1eb00f:/data/models# ollama create gte_qwen2:7b -f Modelfile transferring model data Error: invalid file magic **This is my ModelFile:** …

CHNVigny updated 1 week ago
4
googlecolab/colabtools #4070

Runtime is "Connecting" when code is still running

**Describe the current behavior** I'm using a T4 runtime to do some work with LLMs and after a few hours the Runtime just says "Connecting" and the bottom of the screen says "waiting to finish th…

sinanuozdemir updated 3 weeks ago
22
PromtEngineer/localGPT #538

Is it normal to take nearly 40 seconds to handle one prompt …

On the powerful GPU 4090, Is it normal to take about 40 seconds to finish one generation on the 7B model? It is too slow. model: MODEL_ID = "TheBloke/Llama-2-7b-Chat-GGUF" MODEL_BASENAME = "llam…

Zephyruswind updated 10 months ago
15

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for llm-cpu

1000+ results
for llm-cpu