evaluating-models Search Results

cvignac/DiGress #100

Evaluating Trained Models

I was wondering how the trained models are intended to be evaluated. I don't believe that the paper states how many samples were used to compute the metrics. The code appears to give some indication b…

Markus28 updated 4 days ago

open-compass/opencompass #1239

[Feature] Difficulty in Evaluating Custom Models with OpenCo…

### Describe the feature Dear OpenCompass Team, I've encountered a challenge with OpenCompass when trying to evaluate a custom model that I developed. Currently, it seems that any action I want to…

jiangjiadi updated 1 month ago

CoIR-team/coir #5

Multi GPU Evaluation

excellent work！I'm writing to inquire about the possibility of adding support for multi-GPU evaluation to your evaluation framework. Currently, it seems that the existing evaluations are only designed…

RuiranYan updated 2 weeks ago

TianjinYellow/EdgeDeviceLLMCompetition-Starting-Kit #7

error: 'no predictions found'

Hi! We tried evaluating the base models using the starting kit evaluation pipeline. Here are some points/issues: 1. For phi2 and llama models, we are getting 'prediction not found' error. 2. Could …

sriyachakravarthy updated 4 days ago

EleutherAI/lm-evaluation-harness #2182

Issue with `state-spaces/transformerpp-2.7b` when generating

There appears to be an issue with the `state-spaces/transformerpp-2.7b` model (in the `mamba` family of models) which causes a problem when generating (`Running generate_until requests`). This doesn't…

jhuang265 updated 4 hours ago

juliasilge/juliasilge.com #13

Training, evaluating, and interpreting topic models | Julia …

# Training, evaluating, and interpreting topic models | Julia Silge At the beginning of this year, I wrote a blog post about how to get started with the stm and tidytext packages for topic modeling. …

utterances-bot updated 2 months ago

oobabooga/text-generation-webui #5954

Evaluating performance of models, logprobs in the OpenAI API

Since there are now so many models on HF and it would be useful to understand how they perform on specific tasks or languages. Lately I was trying to use https://github.com/EleutherAI/lm-evaluation…

Werve updated 2 months ago

patil-suraj/question_generation #75

Evaluating the QG models

Hi! I've started working on my own QG algorithm for my Masters thesis and I'm trying to learn how to evaluate a model. Since you've posted your metrics, I've been trying to replicate them, but I a…

KristiyanVachev updated 3 years ago

ersilia-os/3d-analogues #3

🐅 Epic: Test new generative models

**Summary** We used ChemSampler for a first round of generation of dCA analogues, which were evaluated by docking scores only. We still want to get more diversity of compounds so we have been working …

GemmaTuron updated 4 weeks ago

McGill-NLP/llm2vec #123

Unable to load merged model for MTEB evaluation

I have trained a model using supervised contrastive. I saved the model using - `l2v.save('/llm2vec_models/final_merged_model', merge_before_save=True, save_config=True)` Now when I try to run m…

sandeep-krutrim updated 1 day ago

1000+ results for evaluating-models

1000+ results
for evaluating-models