evaluation Search Results

1000+ results
for evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/MobileLLM #14

Evaluation pipeline

Hi Zechun, great work! Could you please share the details about the evaluation code? like which codebase was used to run inference etc. thank you, Kalyani

kalyani7195 updated 1 month ago
1
Lightning-AI/litgpt #1776

Code evaluation using bigcode-evaluation-harness framework

Code evaluation task/benchmark such as HumanEval and MBPP are missing from **lm-evaluation-harness**, but are present and maintained in **bigcode-evaluation-harness**. https://github.com/bigcode-pr…

mtasic85 updated 1 month ago
1
GSA/Challenge_platform #91

Evaluation Recusal

### User story As an evaluator, in order to provide fair evaluation for the submission the public solvers submitted, I would like to be able to indicate if I am not able to evaluate the submission I …

r-bartlett-gsa updated 2 days ago
1
FlagOpen/FlagEmbedding #1244

Evaluation Pipeline for BGE-EN-ICL

Hello authors, thank you for your great work! And I wonder when the Evaluation Pipeline for BGE-EN-ICL will be released?

ShirleyTY updated 5 hours ago
3
wenj/GoMAvatar #7

Evaluation on PeopleSnapshot

If I have downloaded the pretrained models, how to evaluate on PeopleSnapshot? I have tried running this code: `python eval.py --cfg exps/snapshot_"$SCENE".yaml --type view`. However, the metrics are …

JuewenPeng updated 3 weeks ago
3
Jiaxin-Lu/Jigsaw #10

evaluation results

Is there an easy way to visualize the result?

ronaldpan updated 1 month ago
1
fairagro/basic_infrastructure #75

Nextcloud AAI evaluation : Unity

Unity is the OIDC AAI tool that we are evaluating, from the Helmholtz AAI. Carmen Scheuner has an open ticket at https://support.hifis.net/#ticket/zoom/7628 (Helmholtz support - Unity providers). …

darnold-zalf updated 1 week ago
2
lfai/model_openness_tool #53

Unclear "pending evaluation" status - Model Aquila-7B

The MOT merely states that the model is pending evaluation without giving any information as to what this means and what it will take to change this. This actually appears to be rooted in the fact th…

lehors updated 2 days ago
2
0-5788719150923125/praxis #12

Integrate an evaluation harness

We will need to test our models against common, industry-standard benchmarks. Pythia is what everyone uses today: https://github.com/EleutherAI/lm-evaluation-harness The process will involve: -…

Vectorrent updated 8 hours ago
2
Oufattole/meds-torch #48

Fairness Evaluation

Currently if you want to train and evaluate a model you run the following script: ```bash #!/bin/bash set -e # Exit immediately if a command exits with a non-zero status. source $(conda info --…

Oufattole updated 1 month ago
6

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for evaluation

1000+ results
for evaluation