evaluation-datasets Search Results

1000+ results
for evaluation-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

deepset-ai/haystack #7973

Handle errors separately in evaluators and the run results

**Context** When running the evaluators over larger datasets, depending on the model, it is very common to run into LLM errors where the output is not valid JSON. For example, while running the ben…

mrm1001 updated 1 month ago
1
voidful/Codec-SUPERB #38

Results for SemantiCodec

Here is the result for [SemantiCodec](https://haoheliu.github.io/SemantiCodec/) This is a 16Khz codec with three different bit rates: 1. For token rate 100 with book size 16384 the bit rate is 1.35 …

yyua8222 updated 2 months ago
6
OpenMOSS/AnyGPT #31

Train_loss = 0 and Eval_loss = NaN in stage2_sft

Hello! Thank you for your work at MLLM. I had a fine-tuning bug that I couldn't fix: when I ran the `stage2_sft.sh` script and trained with speech_conv_datasets only, the logger showed that the trai…

Everglow-X updated 1 month ago
3
dynamic-superb/dynamic-superb #144

[Task] Vehicle sounds classification

# Task Name Vehicle sounds classification ## Task Objective The primary goal of this task is to evaluate the audio language model's capability to accurately recognize and classify different …

bunbun221008 updated 2 months ago
3
DepthAnything/Depth-Anything-V2 #99

Fine-tuning problems

Dear author, @LiheYoung , hello. The metric depth fine-tune really have baffled me these days: I used my own datasets(sparse labels), and try to lower the lr of the pretrained model vitb or vitl…

Edric-star updated 2 days ago
18
dynamic-superb/dynamic-superb #65

[Task] Covid-19 Cough Audio Classification

# Task Name Covid-19 Cough Audio Classification ## Task Objective To develop and validate a machine learning model that uses audio cough recordings to accurately **identify and differentiate betw…

Bai1026 updated 2 months ago
6
open-compass/VLMEvalKit #336

Custom dataset is not considered MCQ by models

VLMEvalKit version: commit 8e0aace0504d952a25e310a1de66a32c2c1476f1 I added a custom MCQ format dataset to LMUData directory. It is successfully loaded and shows "UserWarning: Will assume unsupport…

zodiacg updated 1 week ago
1
biomodhub/biomod2 #494

Help with BIOMOD_xxx - [short question here]

Hello, Recently, while working with this package, I encountered a problem. I randomly split the test and training data into 20% and 80%. I also generated 10,000 separate pseudo-absence data points. …

saeedbehzadifard1376 updated 1 week ago
1
zou-group/textgrad #112

No Improvement after using TextGrad for Prompt Optimization

Hello, I have currently tweaked the prompt optimization tutorial so that I can see if I can improve it's ability to improve on medical multiple -choice datasets. However, the results are getting p…

nikhilk7153 updated 2 weeks ago
1
paperswithlove/papers-we-read #37

Cambrian-1: A Fully Open, Vision-Centric Exploration of Mult…

Paper : [https://arxiv.org/pdf/2406.16860](https://arxiv.org/pdf/2406.16860) Website : [https://cambrian-mllm.github.io](https://cambrian-mllm.github.io) Code : [https://github.com/cambrian-mllm/cam…

runhani updated 2 months ago
3

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for evaluation-datasets

1000+ results
for evaluation-datasets