mmlu Search Results - Githubissues

1000+ results
for mmlu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

AkihikoWatanabe/paper_notes #1406

To CoT or not to CoT? Chain-of-thought helps mainly on math …

# URL - https://arxiv.org/abs/2409.12183 # Affiliations - Zayne Sprague, N/A - Fangcong Yin, N/A - Juan Diego Rodriguez, N/A - Dongwei Jiang, N/A - Manya Wadhwa, N/A - Prasann Singhal, N/A…

AkihikoWatanabe updated 1 month ago
1
punith300i/nlp-vlm-project #11

Evaluation Framework

- [ ] Shortlist metrics - [ ] Shortlist datasets - eval scripts available - [ ] Write Evaluation script

zvovov updated 11 months ago
2
google-research/FLAN #83

Could you share the training loss to improve reproducibility…

Hi, thanks for sharing the datasets! I'm trying to train a flan model using t5 and other backbone models. However i'm not confident enough on how well I reproduced your results. Specifically I got muc…

xuanqing94 updated 1 year ago
4
CarperAI/trlx #601

OOM error with PEFT LoRA on Llama2-7B

### 🐛 Describe the bug I'm trying to finetune Llama2-7B (to reproduce the experiments in a paper) using PEFT LoRA (0.124% of trainable params). However, this results in an out-of-memory (OOM) error o…

arpaiva updated 1 month ago
1
huggingface/alignment-handbook #120

(QLoRA) DPO without previous SFT

Because of the following LLM-Leaderboard measurements, I want to perform QLoRA DPO without previous QLoRA SFT: ``` alignment-handbook/zephyr-7b-dpo-qlora: +Average: 63.51; +ARC 63.65; +HSwag …

DavidFarago updated 9 months ago
1
BunsenFeng/Knowledge_Card #1

reproduce scores

Hi, Thanks for the interesting paper. I was wondering how to reproduce the scores reported in Table 1. I would appreciate it very much if you could provide the evaluation script. Many thanks.

tigerchen52 updated 3 months ago
1
artidoro/qlora #264

Could not reproduce the results listed in your paper using a…

**Details:** **Here is your result :** I used the following commands to reproduce the results of using the LLaMA 7B model on the Guanaco (OASST1) dataset: **CUDA_VISIBLE_DEVICES=2 sh scripts/…

LiZhangMing updated 5 months ago
6
EleutherAI/lm-evaluation-harness #2234

Cannot load local `mmlu` dataset

I want to load the local `mmlu` dataset, I have already load it from [hails/[mmlu_no_train](https://huggingface.co/datasets/hails/mmlu_no_train)， ![image](https://github.com/user-attachments/assets/3…

AIR-hl updated 1 day ago
3
Joshua-Stapleton/smartgpt-answers #2

Format

PDF isn't really a great format for sharing purposes. If you added a separate folder with .txt or some other simpler-to-parse format I think that'd help anyone trying to use the data. Cheers.

arthurwolf updated 1 year ago
4
jxiw/MambaInLlama #11

Why doesn’t kl_div ignore -100 in pseudo labels?

The original codes looks like below: kl_loss = F.kl_div(F.log_softmax( student_logits, dim=-1), targets, reduction='batchmean') Although the relative loss curve is the…

yynil updated 1 month ago
7

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for mmlu

1000+ results
for mmlu