codellama Search Results

1000+ results
for codellama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #1931

Support loading Int8-quantified (smoothquant) Codellama-13B …

Hi, dear: I have completed the conversion and export of the model format by smoothquant, but when I use vllm to load the model and do inference, the error is as follows: INFO 12-05 09:00:58 tok…

shatealaboxiaowang updated 6 months ago
3
huggingface/optimum-quanto #162

Write a helper to reload a quantized state_dict

quantized weights, scales and metadata can be quantized into a state_dict that can later be reloaded and applied to a quantized model. The process is a bit convoluted, as it requires the target mod…

dacorvo updated 4 months ago
12
continuedev/continue #531

ollama is killed and restart at each prompre

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue](ht…

FrouxBY updated 3 months ago
7
tjake/Jlama #16

CodeLlama loading is broken?

This worked in Oct 15 jlama: ``` $ ./run-cli.sh complete -p "def fib(" -t 0.2 -tc 24 -n 100 models/CodeLlama-7b-hf ``` Now it OOMs (note that I have doubled the default Xmx, which was not nece…

jbellis updated 7 months ago
1
ollama/ollama #1174

The wizardcoder models request too much memory at load time

The context size given to llama.cpp to load this model is 4096, this requires around 10 GB of memory for the context alone, and if we add the 4.5 GB required for the 7B model it's unfeasible to use it…

Nan-Do updated 5 months ago
1
Pythagora-io/gpt-pilot #748

[Bug]: There was a problem with request to openai API: 'ch…

### Version VisualStudio Code extension ### Operating System Windows 10 ### What happened? `ENDPOINT=OPENROUTER` `MODEL_NAME=anthropic/claude-3-opus` I create a new app and it see…

BorisMolch updated 5 months ago
7
QwenLM/Qwen2.5-Coder #75

【Bug】In FIM mode, an extra space is added at the beginning o…

## Description In FIM mode, an extra space is added at the beginning of the first line if is ends with \n. ## How to repeat ```python from transformers import AutoTokenizer, AutoModelForCausalL…

liuzhenghua updated 4 months ago
6
LostRuins/koboldcpp #392

[ENHANCEMENT] (1.42.1+) --> 128k context.

Things are changing at a breakneck pace. There is already a Llama 13b pytorch with 32k context. I figure it would be appropriate to ask for compatibility to be added into Kobold, when time permits.…

SabinStargem updated 1 month ago
10
cloudflare/cloudflare-docs #11699

Workers AI LLM docs not mentioning other models, erroring ou…

### Which Cloudflare product does this pertain to? Workers AI ### Existing documentation URL(s) https://developers.cloudflare.com/workers-ai/models/llm/ ### What changes are you suggesting? Pleas…

jcxmt125 updated 5 months ago
1
ivgtr/github-weeklyTrends #342

Weekly GitHub Trending! (2024/04/15 ~ 2024/04/22)

# Weekly GitHub Trending! (2024/04/15 ~ 2024/04/22) ## Python trending 11repo's ### [1Panel-dev](https://github.com/1Panel-dev) / [MaxKB](https://github.com/1Panel-dev/MaxKB) 💬 LLM 大規模言語モデルに基づくナレッジベース…

ivgtr updated 5 months ago
8

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for codellama

1000+ results
for codellama