ai-evaluation Search Results

1000+ results
for ai-evaluation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EleutherAI/lm-evaluation-harness #2260

Bug in Leaderboard IFEval Code

https://github.com/EleutherAI/lm-evaluation-harness/blob/8138fd52437dcd8c76ac87bdc9d684840e794c42/lm_eval/tasks/leaderboard/ifeval/instructions.py#L1384 the updated IFEval dataset (https://www.oxen…

noowad93 updated 2 months ago
1
OpenPecha/stt-wav2vec2 #9

STT0070: STT wav2vec2 finetuning on situ rinpoche training d…

### Description we need to train wav2vec2 model for specific speaker accent and compare the performance with the base model on test data of that particular speaker. ### Completion Criteria A model th…

gangagyatso4364 updated 6 days ago
3
Azure-Samples/azure-search-openai-demo #1989

o1-preview integration / testing

### This issue is for a: (mark with an `x`) ``` - [ ] bug report -> please search issues before submitting - [X] feature request - [ ] documentation issue or request - [ ] regression (a behavior …

ratkinsoncinz updated 1 month ago
3
Lightning-AI/torchmetrics #2464

Contribution: Add new audio/speech metrics for generative au…

## 🚀 Feature Add new audio metrics for generative audio processing ### Motivation The evaluation of speech processing (denoising, dereverberation and in general enhancement) highly depends o…

d-caviedes updated 3 months ago
7
JailbreakBench/jailbreakbench #34

GPT4 as judge/classifier BadRequestError.

Hi team, I have tried to use GPT4 as a classifier to classify the model responses but am getting content moderation filter trigger. Changes made -- Instead of Llama70BJudge, I have a similar cla…

NamburiSrinath updated 3 days ago
1
Lightning-AI/pytorch-lightning #16830

Support all iterator modes for fit/validate/test/predict

### Description & Motivation `trainer.fit` only works with `CombinedLoader(..., mode="max_size_cycle"|"min_size")` `trainer.{validate,test,predict}` only works with `CombinedLoader(..., mode="se…

carmocca updated 1 month ago
12
uptrain-ai/uptrain #697

Issue Creating new project

**Describe the bug** When attempting to create a new project by entering all the necessary details, an error message is displayed, indicating an issue with running the evaluation. The error seems to …

chiragksharma updated 3 months ago
2
probabl-ai/skore #492

Train_test_split assist v1 - create warnings

As a data scientist, I want to be guided in the choice of the arguments in the scikit-learn train_test_split function, without having too many warnings to avoid being over my cognitive budget charge (…

MarieS-WiMLDS updated 4 days ago
18
haotian-liu/LLaVA #1607

[Question] Unable to submit VQAv2 result file to the evaluat…

### Question When I try to upload the vqav2 result file to the evaluation server https://eval.ai/web/challenges/challenge-page/830/my-submission, after I select the phase, the page will jump to the e…

XiaoruiMaLU updated 2 weeks ago
2
ombhojane/explainableai #102

Enhancement of Model Evaluation Metrics

Currently, the evaluate_model function focuses primarily on accuracy and F1-score for classification models, and MSE and R² for regression models. We could enhance this by including additional evaluat…

Kajalkansal30 updated 3 weeks ago
3

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for ai-evaluation

1000+ results
for ai-evaluation