🤗 Evaluate: A library for easily evaluating machine learning models and datasets.
1.98k
stars
255
forks
source link
LocalModuleTest.test_load_metric_code_eval fails with "The "code_eval" metric executes untrusted model-generated code in Python." #597
Open
jpodivin opened 4 months ago
Environment:
Python 3.8 evaluate: a4bdc10c48a450b978d91389a48dbb5297835c7d OS: Ubuntu 24
Trace: