llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

slark-prime/Context-Enhanced-Question-Answering #25

[Sys/FT] Try LLama2

slark-prime updated 1 year ago
2
AutoGPTQ/AutoGPTQ #222

Llama2-70b to autogptq error.

size mismatch for model.layers.78.self_attn.k_proj.weight: copying a param with shape torch.Size([1024, 8192]) from checkpoint, the shape in current model is torch.Size([8192, 8192]). …

xiechengmude updated 1 year ago
2
CeeZh/LLoVi #3

Performance of LLoVi with 7B llama2

Dear author, thank you for your work! I would like to know the performance of LLoVi on next-qa, next-gqa and IntentQA, when using 7b llama2 as the LLM. For larger model like gpt3.5 and gpt4, they ar…

Leo-Yuyang updated 6 months ago
2
microsoft/Megatron-DeepSpeed #360

Fine-tune llama2 with sequence parallelism

Hi, I am trying to finetune a llama2 model with sequence parallelism using Megatron-DS. Is there any documentation for this ?

AnirudhVIyer updated 7 months ago
3
microsoft/Megatron-DeepSpeed #203

when i train llama2 13b, AssertionError

![image](https://github.com/microsoft/Megatron-DeepSpeed/assets/33349843/c1a12cf3-3a2e-496b-ba53-e652f2d773ee) ```[tasklist] ### Tasks ```

HaoRenkk123 updated 7 months ago
5
calubkk/RAAT #6

About model_name_or_path parameter

I am trying to use the model_name_or_path parameter in this project, but I am unsure where I can find the relevant model links or resources. Could you please provide some guidance on where to download…

Rao-6 updated 2 days ago
1
aws-neuron/aws-neuron-samples #41

Llama2 quantized model on Inf2 generating nonsense

I am following the steps (https://github.com/aws-neuron/aws-neuron-samples/blob/master/torch-neuronx/transformers-neuronx/inference/meta-llama-2-13b-sampling.ipynb) to run a Llama2 quantized model (ht…

sumaiyah updated 5 months ago
6
codefuse-ai/Awesome-Code-LLM #131

HQCM Dataset

In the paper _Understanding Code Changes Practically with Small-Scale Language Models_ (ASE 2024) and the presentation _蚂蚁CodeFuse 的应用实践：应用环境下的代码变更理解技术_ (ChinaSoft2024), you mention the dataset HQCM, …

k1rep updated 3 days ago
1
pytorch/executorch #3444

checkpoint str has no attribute 'get'

I was following the llama2 7b guide, consenus not enough ram and other issues. tried the stories110M guide, worked all the way till I went to test it. I may remember lm_eval not being installed (its…

antmikinka updated 6 months ago
6
smallcloudai/refact #376

Llama2 chat model times out

Llama2 (and Llama-based models) timeout. Other chat models (tested Mistral, Mixtral) respond fine. Below is the snippet of the docker container log capturing when the request is sent from Refact exte…

jcntrl updated 8 months ago
1

上一页 1...19 20 21 22 23 24 25...100 下一页

1000+ results for llama2

1000+ results
for llama2