codellama Search Results

1000+ results
for codellama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

PromptEngineer48/MemGPT-AutoGEN-LLM #9

Feature Suggestion | Serverless Runpod

I see that Runpod has a serverless option. Rather than stopping and starting these instances, is it possible to use these models serverless? It looks like you can modify theBloke's dockerfile and conf…

GeoffMillerAZ updated 11 months ago
3
smallcloudai/refact #311

lora's "catastrophic forgetting" problem

Hi, dear: Thanks for your open source! How did you overcome the catastrophic forgetting problem in lora finetune. The performance dropped a lot on humaneval dataset after lora finetune on my own …

shatealaboxiaowang updated 8 months ago
2
go-skynet/helm-charts #24

[Help needed] CPU usage do not decrease after a request is c…

Does someone else also has the problem that after chat request the cpu load do not decrease? I'm using [CodeLlama-34B-Instruct-GGUF](https://huggingface.co/TheBloke/CodeLlama-34B-Instruct-GGUF/blob…

3deep5me updated 9 months ago
1
hiyouga/LLaMA-Factory #4388

关于npu训练模型总结以及疑问

### Reminder - [X] I have read the README and searched the existing issues. ### System Info ## QWEN2-1.5B(0.5B) 正常 ## QWEN2-7B(MoE) 需要使用bf16 #4278 正常 ## QWEN2-72B 正常，有一点点问题，只能在8卡上启动（s…

sweetning0809 updated 3 months ago
19
gradusnikov/eclipse-chatgpt-plugin #31

Llama support?

Do you have plans to support other LLM models like Llama 3? Or would it be easy to modify code implementing interface to OpenAI. I would like inerface using Ollama. Any hints would be appreciate…

hellfire7707 updated 4 months ago
4
continuedev/continue #587

Hallucinated answers returned when explaining a few hundreds…

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [ ] I'm not able to find an [open issue](ht…

cheuk-cheng updated 3 months ago
12
WasmEdge/WasmEdge #3495

LFX Workspace: finetuning LLMs for Rust learning

### Summary # Motivation WasmEdge is a lightweight inference runtime for AI and LLM applications. Build specialized and finetuned models for WasmEdge community. The model should be supported by Wa…

codemaster1104 updated 1 month ago
46
TabbyML/tabby #2527

bug: `--chat-device` option broken (Mixed GPU + CPU for comp…

**Please describe the feature you want** I've been using a large completion model with my GPU. I'd like to add a chat model as well, but there's not enough GPU memory for the large completion model…

jtbr updated 1 month ago
9
vllm-project/vllm #6509

[Usage]: No chat template provided. Chat API will not work. …

### Your current environment How do I get vllm to support Codellama-34B in openai format? I run TheBloke/CodeLlama-34B-Instruct-AWQ in vllm, but it show 'No chat template provided. Chat API will n…

x0w3n updated 2 weeks ago
1
triton-inference-server/tensorrtllm_backend #328

ERROR: Failed to create instance: unexpected error when crea…

### System Info - CPU architecture (x86_64) - CPU/Host memory size (64GB) - GPU properties - GPU name (1x NVIDIA V100) - GPU memory size (32GB) - Libraries - TensorRT-LLM branch or tag …

ken2190 updated 6 months ago
5

上一页 1...44 45 46 47 48 49 50...100 下一页

1000+ results for codellama

1000+ results
for codellama