model-evaluation Search Results

1000+ results
for model-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dailenson/One-DM #23

No find code evaluation for model(FID score)

Hello author, I am not able to find the code to confirm the evaluation metric you stated in the paper(FID: 15.73) with your checkpoint. I have tried the FID evaluation codes available online but the r…

LuongTuanAnh163002 updated 3 weeks ago
2
UppuluriKalyani/ML-Nexus #714

Feature Request: Model Evaluation and Benchmarking System

I propose adding a Model Evaluation and Benchmarking System to ML Nexus to help users assess their model performance on standardized datasets and compare it against benchmarked scores. This feature wo…

snehas-05 updated 3 weeks ago
1
ScandEval/ScandEval #556

[MODEL EVALUATION REQUEST] CohereForAI/aya-expanse-32b

### Model ID CohereForAI/aya-expanse-32b ### Model type Decoder model (e.g., GPT) ### Model languages - [X] Danish - [X] Swedish - [X] Norwegian (Bokmål or Nynorsk) - [X] Icelandic - [X] Faroese …

saattrupdan updated 1 week ago
8
jkli1998/DRM #4

About model evaluation

Hi, thanks for your great work! Now I'm trying to evaluate model on VG dataset but meet some problems. 1. Only one file named `vg_stage1_predcls.zip` in the provided link in [Evaluation](https://gith…

QiueY514 updated 2 months ago
3
aioz-ai/GCD #2

Pretrained model & Evaluation code

Thx for your amazing work! I also notice that you haven't provided the **Pretrained model & Evaluation code**. Is there any possible that you would upload them? Thanks again!

Da1yuqin updated 2 weeks ago
4
fgnt/tssep_data #1

Evaluation on pretrained model

Hi, While trying to run the evaluation on the pretrained model: https://github.com/fgnt/tssep_data/blob/master/egs/libri_css/README.md#steps-to-evaluate-a-pretrained-model I got this error on t…

nikifori updated 1 month ago
1
EleutherAI/lm-evaluation-harness #2511

mmlu_flan_cot_zeroshot breaks after running the generation t…

I tried running some CoT zeroshot evaluations, but they both failed. Am I doing something wrong? ### Command for mmlu_flan_cot_zeroshot ``` accelerate launch \ --multi_gpu \ --num_p…

HideLord updated 1 day ago
1
Wang-ML-Lab/multimodal-needle-in-a-haystack #2

Evaluation script for open-source models

Dear authors, thank you for the great work in long-context multi-model evaluation. In the code base, I only saw the code for Azure, OpenAI, Gemini, and Anthropic, could you also provide the evalu…

joyolee updated 1 month ago
1
confident-ai/deepeval #1139

Error while calculating Knowledge retention ; Evaluation LLM…

**Describe the bug** while running matrices **Knowledge retention**, getting error. I ensure that this is not all of the LLMTestcases. I am getting correct knowledge retention score for many inputs. …

jaysudhakaran updated 2 weeks ago
4
albertan017/LLM4Decompile #35

Error while deserializing header: MetadataIncompleteBuffer

I am trying to evaluate llm4decompile-6.7b-v1.5 using the methods you provided. The model weights were downloaded from the Hugging Face repository of the same name. However, I keep encountering an err…

blacksunfm updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-evaluation

1000+ results
for model-evaluation