Closed DeanChugall closed 1 month ago
Fixed ruff format --check .
for ci
,
It would be great that we using pre-commit run
but when this is run some of file not satisfy criteria, and I don't want to mess with this file.
File affected with pre-commit run
on image below.
etc ...
Mhh this should not happen, are you sure you are running the correct versions ?
Mhh this should not happen, are you sure you are running the correct versions ?
Absolutely, try checking at least one of those files manually in eg: evaluation-task-request.md
let's just wait for the quality check and see if we can merge.
Serbian LLM Benchmark Task Configuration and Prompt Functions
Summary:
This pull request introduces task configurations and prompt functions for evaluating LLM models on various Serbian datasets. The module includes tasks for:
The tasks are defined using the
LightevalTaskConfig
class, and prompt generation is streamlined through a reusableserbian_eval_prompt
function.Changes:
Task Configurations:
LightevalTaskConfig
.HFSubsets
added for dataset subset management, improving code maintainability and clarity.create_task_config
function allows dynamic task creation with dependency injection for flexibility in dataset and metric selection.Prompt Functions:
serbian_eval_prompt
function creates a structured multiple-choice prompt in Serbian.Logging:
hello_message
banner is printed upon task initialization, listing all available tasks.hlog_warn
.Key Features:
HFSubsets
Enum improves the readability and maintainability of the dataset subset references.create_task_config
function simplifies task creation, promoting cleaner and more maintainable code.Future Enhancements: