llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/xla #6778

Spmd pre-training llama2 multi-machine training so slow?

spmd has a normal training speed using eight blocks on a single machine, but the communication overhead increases rapidly in the case of multiple machines device is： gpu：A100 * 8 * 2 spmd strategy …

mars1248 updated 5 months ago
23
meta-llama/llama #673

Running Llama2 models locally on Windows10

I got approval from meta, then I downloaded all meta Llama2 models locally(I followed all steps and everything was fine). I tried to run the model 7B using this command “torchrun --nproc_per_node 1 …

BoussabatWael updated 1 year ago
18
CLUEbenchmark/SuperCLUE-Llama2-Chinese #1

Evaluate on our Chinese-LlaMA2 models

https://github.com/michael-wzhu/Chinese-LlaMA2

michael-wzhu updated 1 year ago
1
arcee-ai/mergekit #139

merging two llama2 model but different special vocab tokens

Hi, I would like to merge: https://huggingface.co/Unbabel/TowerInstruct-7B-v0.1 with https://huggingface.co/haoranxu/ALMA-7B-R But the Tower one has 7 special tokens hence a vocab 32007 and t…

vince62s updated 5 months ago
13
jianzhnie/LLamaTuner #87

llama2-13B和llama2-70b微调所需要的显卡配置

您好！请问有推荐的配置吗？

batindfa updated 1 year ago
1
AkihikoWatanabe/paper_notes #886

LLaMA2を3行で訓練

https://huggingface.co/docs/trl/main/en/lora_tuning_peft#finetuning-llama2-model

AkihikoWatanabe updated 11 months ago
1
Dan-wanna-M/kbnf #4

More comprehensive benchmark

The current benchmark is a bit too simple. We need some practical grammars. Vocabulary other than RWKV vocabulary should be benchmarked as well. - [ ] JSON - [ ] *OT chains - [ ] Llama2 vocabular…

Dan-wanna-M updated 3 months ago
1
robusta-dev/holmesgpt #129

Trying holmesgpt with local ollama3.1 fails w. KeyError: 'na…

Hi, I wanted to give this a try and installed ollama locally. I am able to use the ollama API on http://localhost:11434/api/generate with curl. I evaluated `export OLLAMA_API_BASE=http://localhost:…

suukit updated 1 month ago
2
ankan-ban/llama_cu_awq #5

bug: infinite printing

There is a bug in which it keeps printing blank lines in a loop I was not able to discover the reason It only happens on some prompts. Here is an example in which it happen (7B): ``` ./llama…

kroggen updated 1 year ago
4
zepingyu0512/neuron-attribution #2

Should the attention consider the attention group number?

`#find query FFN neurons activating attn neurons curfile_ffn_score_dict = {} for l_h_n_p, increase_score in cur_file_attn_neuron_list_sort[:30]: attn_layer, attn_head, attn_neuron, attn_pos = l…

sev777 updated 4 days ago
1

上一页 1...34 35 36 37 38 39 40...100 下一页

1000+ results for llama2

1000+ results
for llama2