llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #1819

Planned/Potential of significant work

- [ ] Fp8 kv-cache - [ ] Kv-cache prefix reuse - [ ] Grammar constrained speedup - [ ] `torch.compile` like speedups - [ ] Simple one-liner `pip install` - [ ] Multi lora support (lorax kind of) …

Narsil updated 2 weeks ago
1
Alpha-VLLM/LLaMA2-Accessory #90

[Question] SPHINX: In-context learning

Hi, I was really impressed by SPHINX's capability. However, is it possible to do in-context learning with it? Something similar to your example for Multimodal LLaMA2 https://alpha-vllm.github.i…

baptistecolle updated 8 months ago
1
yangjianxin1/Firefly #73

请教 NotImplementedError: Cannot copy out of meta tensor; no d…

![image](https://github.com/yangjianxin1/Firefly/assets/57835580/29229f58-1897-4c71-aa61-355f846e2946) 加载YeungNLP/firefly-llama2-13b时报如上错误

LittlefishStudent updated 11 months ago
1
DLLXW/baby-llama2-chinese #30

没有SFT的话推理会抱错，麻烦看看

Traceback (most recent call last): File "/home/hope/work/baby-llama2-chinese/eval_hope.py", line 67, in model.load_state_dict(state_dict, strict=False) File "/home/hope/miniconda3/envs/lla…

hopeforus updated 10 months ago
2
AutoGPTQ/AutoGPTQ #337

What do you consider a good dataset size/rows for quantizati…

Wondering what would be a decent size for the dataset could be for the model to be quantized, I am looking at a model like LLama2-7B. Any help is appreciated

nadimintikrish updated 7 months ago
1
Deelvin/mlc-llm #6

Optimize SmoothQuant alpha per-topology for the best accurac…

- Use different datasets for calibration (dummy, Pile, gsm8k, triviaqa ans so on) - Use llama2-7b with different int8 quantization types - Use alpha in range (0, 1) - Use lm-evaluation-harness to accu…

vvchernov updated 7 months ago
1
mozilla/firefox-translations-training #186

Evaluate translation capabilities of LLMs

If practical, the LLMs might be useful for a variety of tasks: - Quality evaluation - Data augmentation (including back translation for low-resource languages) - Using as a teacher model As a fi…

eu9ene updated 5 months ago
6
triton-inference-server/tensorrtllm_backend #140

Thread [0] had error: in ensemble 'ensemble', Encountered er…

Try to use perf_analyzer as follows deploying LLaMA2-13B with triton: python scripts/launch_triton_server.py --world_size 2 --model_repo triton_model_repo perf_analyzer -m ensemble -i grpc --shape…

Juelianqvq updated 2 months ago
7
locuslab/tofu #16

eval generates answer same as dataset

I finetuned llama2 on the full dataset, ran gradient ascent on forget05, and then evaluated the unlearned model on forget05. Surprisingly, when I looked at the eval_log_forget.json file all I could se…

shaswati1 updated 3 months ago
9
microsoft/vidur #20

Kaleido subprocess Segmentation fault

Hi, I'm getting below error when trying to start vidur simulator in ubuntu 20.04 on python 3.10 venv, also i tested with mambo INFO 07-09 16:17:21 config.py:21] trace_request_length_generator_deco…

rajeshitshoulders updated 16 hours ago
2

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for llama2

1000+ results
for llama2