mmlu Search Results - Githubissues

1000+ results
for mmlu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/litgpt #1266

Problem when evaluating finetune model using adapter_v2

Hi, I just notice in the finetune with adapter_v2, we're saving the final model with the name `lit_model.pth.adapter_v2` ``` # Save the final Adapter checkpoint at the end of training sav…

TonAnh updated 6 months ago
6
artidoro/qlora #69

Model finished training, but adapter_model.bin is empty?

I started the training using: ``` python qlora.py \ --model_name_or_path /home/nap/llm_models/llamaOG-65B-hf/ \ --output_dir ./output \ --dataset alpaca \ --do_train True \ …

disarmyouwitha updated 1 year ago
4
vllm-project/vllm #5067

[Bug]: The VRAM usage of calculating log_probs is not consid…

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: …

Conless updated 1 week ago
10
jiphyeonjeon/season3 #24

Training Compute-Optimal Large Language Models

## 집현전 최신반 스터디 - 2022년 5월 15일 일요일 10시 - 진명훈님 전재영님 박동주님 발표 - 논문 링크: https://arxiv.org/abs/2203.15556 > ### Abstract > We investigate the optimal model size and number of tokens for training a tr…

jinmang2 updated 2 years ago
1
huggingface/candle #2031

The output diverges in comparison to the Python implementati…

I've noticed that the generation diverges after some tokens in comparison to the HF implementation. Is this expected? Here's how to reproduce: **Transformers** ```python import torch from tra…

hugoabonizio updated 7 months ago
5
EleutherAI/lm-evaluation-harness #1340

NAN value for truthfulqa_mc2 on full finetuned model TinyLla…

I checked this [issue](https://github.com/EleutherAI/lm-evaluation-harness/issues/714#top) has similar problem I have, however using the latest main branch doesn't solve the problem! ## Model: - F…

hahmad2008 updated 9 months ago
10
TIGER-AI-Lab/MMLU-Pro #26

regarding leaderboard submission

Hello, I have a set of pretrained models, and I plan to evaluate them on the MMLU-Pro benchmark without any additional training loccaly, selecting the best-performing model for submission. Is this app…

sorobedio updated 1 month ago
1
EleutherAI/lm-evaluation-harness #2318

Evaluation of MMLU tasks using the OpenAI API

"Hello, I'm trying to evaluate the GPT-4o model using the MMLU dataset, but I'm encountering an error. Could you advise me on how to proceed?" "This is the command I used: lm_eval --model openai…

Laplace888 updated 1 month ago
3
ChatGPTNextWeb/ChatGPT-Next-Web #4030

[Feature] Plans to add model provider support

There have been many discussions in the community regarding support for multiple models. - ChatGPTNextWeb#3484 - ChatGPTNextWeb#3923 - ChatGPTNextWeb#960 - ChatGPTNextWeb#3431 - ChatGPTNextWeb#…

fred-bf updated 2 months ago
20
open-compass/opencompass #819

[Bug] Partition tasks sometimes fail due to occupied ports

Hi, thanks for sharing this great open-source project! When using multiple GPUs for evaluation, I found partition tasks sometimes will fail due to occupied ports. ### Prerequisite - [X] I have s…

sdc17 updated 6 months ago
1

上一页 1...21 22 23 24 25 26 27...100 下一页

1000+ results for mmlu

1000+ results
for mmlu