mmlu Search Results - Githubissues

1000+ results
for mmlu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

open-compass/opencompass #1509

[Feature] 数据集版本问题

### 描述该功能请问我们提供的数据集为什么有这么多的版本？是内容有更新还是其他的原因？怎么确定评测一个模型的时候使用哪个版本呢？谢谢 ![image](https://github.com/user-attachments/assets/e1107a05-5add-4a63-8680-7e8e3496720d) ### 是否希望自己实现该功能？ - [ ] 我希望自己来实现这一功能…

bjzhb666 updated 1 month ago
2
composable-models/llm_multiagent_debate #5

A question about eval_mmlu.py

In the compute_accuracy function in eval_mmlu.py, there is a line of code on line 86 that reads `if pred_answer is None: return 1`. However, if pred_answer is None, shouldn't the function return 0 ins…

1ittlesnow updated 1 year ago
9
confident-ai/deepeval #914

deepeval v0.21.71 - v 0.21.73 MMLU prediction results are wr…

**❗BEFORE YOU BEGIN❗** Are you on discord? 🤗 We'd love to have you asking questions on discord instead: https://discord.com/invite/a3K9c8GRGt **Describe the bug** I have followed the page of "htt…

dequanchen updated 2 months ago
5
huggingface/lighteval #275

[BUG] Zero accuracy in Hellaswag for Llama-2-7b (using 8bit …

command: accelerate launch run_evals_accelerate.py --model_args="Llama-2-7b-chat-hf-8bit,quantization_config="load_in_8bit=True"" --tasks "helm|hellaswag|1|0" -- --output_dir ./evalscratch Resul…

rankofootball updated 1 month ago
2
facebookresearch/lingua #5

mmlu benchmark scores

hi guys, thanks for sharing high quality hackable codebase ! ive just wondered how 7B llama with dclm 1T tokens can achieve 60++ mmlu score. to my best knowledge, it should consume enough flops (like …

SeunghyunSEO updated 9 hours ago
5
yuhuixu1993/qa-lora #17

Training with multi gpus, increase the batch size, and how t…

Hi, 1. When I use the command on 8 gpus: ``` python3 qalora.py --model_path $llama_7b_4bit_g32 ``` it will show the error: ``` File "/home/shawn/anaconda3/envs/qalora/lib/python3.8/site-pa…

shawnricecake updated 2 months ago
5
NVIDIA/TensorRT-LLM #1618

Llama-2 13B SmoothQuant W8A8 Per-Tensor TP-4 performance is …

### System Info GPUs: A100, 4 GPUs (40 GB memory) Release: tensorrt-llm 0.9.0 ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scrip…

ghost updated 3 months ago
7
huggingface/lighteval #61

Add single `mmlu` config for `lighteval` suite

Currently it seems that to run MMLU with the `lighteval` suite, one needs to specify all the subsets individually as is done for leaderboard task set [here](https://github.com/huggingface/lighteval/bl…

lewtun updated 8 months ago
1
artidoro/qlora #181

How were numbers in Table 5 generated?

Hi, reading the [QLoRA paper](https://arxiv.org/pdf/2305.14314.pdf), you folks are reporting the results on MMLU test set in Table 5: ![image](https://github.com/artidoro/qlora/assets/44957968/cffd7c…

kogolobo updated 1 year ago
1
open-compass/opencompass #1262

有人配置过mmlu_pro数据集么？求分享代码~

### 描述该功能求配置mmlu_pro数据集的代码逻辑~ ### 是否希望自己实现该功能？ - [ ] 我希望自己来实现这一功能，并向 OpenCompass 贡献代码！

wll-design updated 2 months ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for mmlu

1000+ results
for mmlu