codellama Search Results

1000+ results
for codellama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ai-shifu/ChatALL #666

[FEAT] Add the top LMSYS Chatbot Arena / HuggingChat / Poe L…

### Is your feature request related to a problem? / 你想要的功能和什么问题相关？ There are more models in [LMSYS Chatbot Arena](https://huggingface.co/spaces/lmsys/chatbot-arena) / [HuggingChat](https://huggingf…

nightkall updated 9 months ago
1
LlamaFamily/Llama-Chinese #295

关于atom-7b-chat长文本微调应如何进行？

感谢项目组提供的模型，非常优秀，也因此我希望基于你们模型再微调以供后续使用。在使用的时候遇到两个问题。 1> 模型调用，在 [https://huggingface.co/FlagAlpha/Atom-7B-Chat](url) 上开篇提到 Atom-7B-32k-Chat ，不知该模型本身是否已经支持32K？是否使用的时候直接加载即可，不需要额外修改文件或参数，能使用32k长度 2>…

hbj52 updated 7 months ago
5
turboderp/exllama #262

RoPE Frequency Base and Frequency Scale Support

As of now, there is no way to modify RoPE Frequency Base and RoPE Frequency Scale. We would need to edit `rope.cu` to support parameters for frequency and scale: https://github.com/turboderp/exlla…

ChrisCates updated 1 year ago
3
llm-attacks/llm-attacks #103

[BUG] self.conv_template.append_message(self.conv_template.r…

There's a bug in attack_manager.py: ``` if self.conv_template.name == 'llama-2': self.conv_template.messages = [] self.conv_template.append_message(self.conv_template.roles…

Anson-He updated 3 months ago
7
ziansu/prorec #2

ValueError: Unknown split "train". Should be one of ['test']…

When I try to run the following command, > `accelerate launch --num_processes=4 big_model_quantized_probing.py scripts/configs/probe_quantized_codellama-34b-4bit-unfreeze.yaml` I got the followi…

ljk419511 updated 2 months ago
1
konveyor/kai #397

podman compose - broken log/trace to disk when using the sup…

We've recently broken logging and tracing to disk when run via `podman compose up`. We are NOT writing to the shared directory which is accessible by both the container AND the host. Below is a snipp…

jwmatthews updated 5 days ago
1
huggingface/transformers #26350

Community contribution: Adding Flash Attention 2 support for…

### Feature request Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training: https://github.com/Dao-AILab/flash-attentio…

younesbelkada updated 5 days ago
108
pytorch/torchchat #1222

Clear model download documents

### 🐛 Describe the bug From the README, its not very clear how to download different flavor/sizes of the models from HF, unless someone go to the next section and find the inventory list https://gi…

HamidShojanazeri updated 2 weeks ago
4
lmstudio-ai/lmstudio-bug-tracker #164

Can't load any model since 3.x upgrade

As for me no model is able to load anymore with versions higher then 2.x.

dynamiccreator updated 2 days ago
4
opea-project/GenAIInfra #338

GMC: response timeout with some specific models

In GMC e2e tests, there are some failed case caused by response timeout, tgi2.2.0 + meta-llama/CodeLlama-7b-hf on xeon in codegen test tgi2.2.0 + meta-llama/CodeLlama-7b-hf on guadi is ok s…

KfreeZ updated 1 month ago
2

上一页 1...41 42 43 44 45 46 47...100 下一页

1000+ results for codellama

1000+ results
for codellama