llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

simonw/llm #181

llm response truncated for llama2

Hi Simon, the responses from llama2 have been truncated. What is the good way for llm to handle this? see % llm -m l2c "give me 20 good names for avatars" --system "you are a creator" Sure, here…

gao-jian updated 1 year ago
4
facebookresearch/SpinQuant #14

Clarification of RMSNorm layer fusion

Hi, Thanks very much for your work and for publishing your code. I am currently working on integration of SpinQuant into [torch/ao](https://github.com/pytorch/ao/pull/983/), and I would like to cla…

tobiasvanderwerff updated 5 days ago
1
imoneoi/openchat #215

About performance of LLama (llama2, llama3) model

Thank you for your wonderful work! Have you ever experimented with LLama2-7B as the model to do C-RLFT? How about the performance? Because OpenChat-3.5-0106 is based on Mistral, performance is real…

huazhenliu updated 6 months ago
1
microsoft/LongRoPE #7

unexpected keyword argument 'attn_implementation'

Traceback (most recent call last): File "/home/m00830934/code/LongRoPE/evolution/evaluate.py", line 110, in main(args) File "/home/m00830934/code/LongRoPE/evolution/evaluate.py", line 52, …

momandai updated 3 months ago
1
vllm-project/vllm #3013

Large length variance of sampled sequences from llama2 70b m…

Hi all, thanks for this great inference framework. We enjoy the speedups coming from it, but we are concerned about too high sampling variance. Setting: **Model**: llama2 70b model finetuned on…

uralik updated 2 weeks ago
3
simonw/llm #183

Add an ollama plugin

Ollama makes it easy to run models such as llama2 locally on macOS easily: https://ollama.ai/ The user runs a server on localhost, so the architecture of the plugin could likely follow the exist…

cmungall updated 10 months ago
1
wenyudu/MIGU #2

gradient issue when chosing the method of "cluster_activate"…

Hello, Thank you for your very interesting work! when I run the llama2 experiment with "cluster_activate" and "random_update". There is the following error for calculating the gradient. Could you…

William-Zhang1 updated 1 month ago
1
vllm-project/llm-compressor #853

Perplexity (ppl) Calculation of Local Sparse Model: NaN issu…

👋 Hello Neural Magic community developers, I encountered an issue while calculating the perplexity for a locally converted Llama3-8B sparse model using the llm-compress library. I'm refer the spars…

HengJayWang updated 3 weeks ago
2
pytorch/torchtune #1526

Loss not going down for fine-tuning Llama3-8B on C4

I'm fine-tuning Llama3-8B on the C4 dataset (en subset) for 2000 steps using the `full_finetune_distributed` recipe. I find that the loss did not go down at all and the quantized accuracy is very low.…

andrewor14 updated 2 months ago
2
guidance-ai/guidance #513

Modify the state directly

I would like to build a chatbot with a long context. However, if the context gets too long, to prevent going over the model's context limits, I want to be able to delete old messages. I would also lik…

dnhkng updated 8 months ago
4

上一页 1...24 25 26 27 28 29 30...100 下一页

1000+ results for llama2

1000+ results
for llama2