stanford-crfm helm issues

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image models in Holistic Evaluation of Text-to-Image Models (HEIM) (https://arxiv.org/abs/2311.04287).

https://crfm.stanford.edu/helm

Apache License 2.0

1.76k stars 234 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Set lower_is_better to false for AIR-Bench scores

#2788 yifanmai opened 3 hours ago
0
Add landing page for ThaiExam

#2787 yifanmai opened 20 hours ago
0
Release Lite v1.5.0

#2786 yifanmai closed 1 day ago
0
The paper is inaccessible...

#2785 zhimin-z opened 1 day ago
0
Add AI Singapore to partners list

#2784 yifanmai closed 1 day ago
0
Sort by correctness functionality on Predictions page in front end

#2783 farzaank opened 4 days ago
0
Fix incorrect client args for SambaLingo-Thai-Chat-70B

#2782 yifanmai closed 5 days ago
0
Fix alpha sort before proper sort on leaderboard tables

#2781 farzaank opened 5 days ago
0
Add MMLU v1.5.0 to project metadata

#2780 yifanmai closed 5 days ago
0
Are we able to feed HELM a table of LLM input/output instead of connecting to a model?

#2779 mzahorec closed 6 days ago
2
BHASA syntax prototyping

#2778 yifanmai opened 6 days ago
0
Build frontend

#2777 github-actions[bot] opened 1 week ago
0
Update image2struct schema

#2776 teetone closed 1 week ago
0
Add local tokenizer for AI21

#2775 yifanmai opened 1 week ago
0
Fix typing error in UITVSFCScenario

#2774 yifanmai closed 1 week ago
1
Add TabFact to HELM Tables run entries and schema

#2773 yifanmai closed 1 week ago
0
Improve leaderboards grid on mobile screens

#2772 yifanmai opened 1 week ago
0
Add AI Singapore to partners list

#2771 yifanmai closed 1 day ago
0
Fix backports.zoneinfo and pkg_resources breakages in main

#2770 yifanmai closed 1 week ago
0
Update requirements.txt

#2769 github-actions[bot] closed 1 week ago
0
Add Typhoon v1.5x models

#2768 yifanmai closed 1 week ago
0
Fix dependabot alerts

#2767 yifanmai closed 1 week ago
0
Add AI21 Jamba Instruct

#2766 yifanmai closed 1 week ago
0
Update requirements.txt

#2765 github-actions[bot] closed 1 week ago
0
Chart runs out of colors when there are too many organizations

#2764 yifanmai opened 1 week ago
0
Add Claude 3.5 Sonnet (20240620)

#2763 yifanmai closed 1 week ago
0
Add multi-GPU support to HuggingFaceClient

#2762 yifanmai closed 1 week ago
0
Llama3 openai on azure

#2761 abhay-shete closed 2 weeks ago
1
Fix lint error

#2760 yifanmai closed 2 weeks ago
0
Release Lite v1.4.0

#2759 yifanmai closed 2 weeks ago
0
Add SambaLingo-Thai-Base-70B and SambaLingo-Thai-Chat-70B

#2758 yifanmai closed 2 weeks ago
0
Fix bugs with sambalingo and openthaigpt models

#2757 yifanmai closed 2 weeks ago
0
How can I get efficiency metric for my huggingface local model? (llama2)

#2756 Mr-lonely0 closed 2 weeks ago
2
Bump the pip group across 1 directory with 7 updates

#2755 dependabot[bot] closed 1 week ago
1
Bump the pip group across 1 directory with 6 updates

#2754 dependabot[bot] closed 2 weeks ago
1
Bump the npm_and_yarn group across 1 directory with 2 updates

#2753 dependabot[bot] closed 2 weeks ago
0
Yi-large Model deployment not found / Qwen2-72b-instruct tokenizer issues

#2752 Joey-Ji closed 1 week ago
8
Release v0.5.2

#2751 yifanmai closed 2 weeks ago
0
Move source dataset files for entity_data_imputation to GCS

#2750 yifanmai closed 2 weeks ago
0
Add "Answer only the last question" instruction using run expander

#2749 yifanmai closed 2 weeks ago
0
Build frontend

#2748 github-actions[bot] closed 2 weeks ago
0
Add SambaLingo Thai models

#2747 yifanmai closed 2 weeks ago
0
Change frontend project ID from image2structure to image2struct

#2746 yifanmai closed 2 weeks ago
0
Add more Typhoon models

#2745 yifanmai closed 2 weeks ago
0
Add SeaLLM models

#2744 yifanmai closed 2 weeks ago
0
Add OpenThaiGPT models

#2743 yifanmai closed 2 weeks ago
0
Merging Source into Fork

#2742 jackhauGR closed 2 weeks ago
0
Redo: Add MMLU v1.4.0 to project_metadata.json

#2741 yifanmai closed 3 weeks ago
0
Add MMLU v1.4.0 to project_metadata.json

#2740 yifanmai closed 3 weeks ago
0
Change URL from image2structure to image2struct

#2739 yifanmai closed 3 weeks ago
0