huggingface / evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
https://huggingface.co/docs/evaluate
Apache License 2.0
1.96k stars 249 forks source link

Evaluation not working with `microsoft/deberta-v3-large` #553

Open msinha251 opened 6 months ago

msinha251 commented 6 months ago

Hi,

I was trying to evaluate the model trained on microsoft/deberta-v3-large but looks like it's having some issues on indexing. Any ideas or support for this one ?

Error: IndexError: list index out of range

Code to Reproduce:

import pandas as pd
from datasets import load_dataset
from evaluate import evaluator
from transformers import pipeline

models = [
    "microsoft/deberta-v3-large"
]

data = load_dataset("conll2003", split="validation").shuffle().select(range(10))
task_evaluator = evaluator("token-classification")

results = []
for model in models:
    results.append(
        task_evaluator.compute(
            model_or_pipeline=model, data=data, metric="seqeval"
            )
        )

df = pd.DataFrame(results, index=models)
df[["overall_f1", "overall_accuracy", "total_time_in_seconds", "samples_per_second", "latency_in_seconds"]]
msinha251 commented 4 months ago

Can someone please advice, it's still the same issue ?

maayanorner commented 1 month ago

Same issue, another model.