model-evaluation Search Results

1000+ results
for model-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

symflower/eval-dev-quality #237

Dump the assessments in the CSV files once they happen and n…

## TODO **1st iteration** - [x] Dump the assessments into the `evaluation.csv` every time a task is executed **2nd iteration** - [x] Create the other CSVs from the `evaluation.csv` - read…

ruiAzevedo19 updated 1 day ago
1
ScandEval/ScandEval #359

[MODEL EVALUATION REQUEST] Jamba

### Model ID ai21labs/Jamba-v0.1 ### Model type Decoder model (e.g., GPT) ### Model languages - [x] Danish - [x] Swedish - [x] Norwegian (Bokmål or Nynorsk) - [x] Icelandic - [x] Faroese - [x] Ge…

KennethEnevoldsen updated 2 months ago
1
RoyZhao926/CatVersion #3

model evaluation

I sincerely want to evaluate your model, how can I run it more simply

YeLuoSuiYou updated 5 months ago
1
symflower/eval-dev-quality #205

Tool/command to combine multiple evaluations into one

If we run them on multiple machines, we want to easily combine them. ### TODO **1st iteration** - [x] Remove the `models-summed.csv` and `-summed.csv` files - [x] Remove all the evaluation CS…

bauersimon updated 8 hours ago
1
EleutherAI/lm-evaluation-harness #2076

Invalid modification to task YAML file

I am running llama2 model in wikitext dataset. I just want try some other metrics so I modify the default YAML file(`lm-evaluation-harness/lm_eval/tasks/wikitext/wikitext.yaml`) to the following, just…

duanhx1037 updated 1 day ago
2
OpenBMB/Eurus #2

Code for evaluation of Euros Models

Can you add the code for reproducing the main results in the paper for various math and coding datasets, along with their prompts and the data splits used?

archiki updated 1 month ago
5
ScandEval/ScandEval #277

[MODEL EVALUATION REQUEST] Mamba-2.8b

### Model ID state-spaces/mamba-2.8b-hf ### Model type State space model (e.g., Mamba) ### Model languages - [x] Danish - [x] Swedish - [x] Norwegian (Bokmål or Nynorsk) - [x] Icelan…

saattrupdan updated 3 weeks ago
2
Mushroomcat9998/PaddleOCR #3

Paddle OCR Text detection Model Evaluation take too much tim…

I am training PaddleOCR for Tamil language and when I train the model with 70 images for training and 24 images for evaluation. After 1st epoch the model take more time (More than 5 hrs) on Evaluat…

kokul93 updated 1 day ago
1
artc-dsc/Tasks #45

Takeover Statistical Models

### Latest Code: [model evaluation](https://github.com/artc-dsc/AI-FusionCast-Analysis/blob/dev/scripts/subprocess_model_evaluation.py) [model prediction](https://github.com/artc-dsc/AI-FusionCast-…

ZengyuCao-ARTC updated 2 weeks ago
1
mihdalal/planseqlearn #8

How can I get the evaluation results of the trained model?

Hi there! Thanks for publicizing such an awesome project! I would like to ask how I can get the results of model evaluation similar to the results shown in the paper? Because it seems like only th…

albzni updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-evaluation

1000+ results
for model-evaluation