This PR aims to implement the main processes for the evaluation framework. Instructions on how to use it are provided in the README file in the eval directory. Currently, we don't fully support all metrics (e.g., accuracy, contextual recall, etc.), but we plan to release these features soon. Additionally, we provide an example evaluation dataset and metric to give members a preview.
This PR aims to implement the main processes for the evaluation framework. Instructions on how to use it are provided in the README file in the
eval
directory. Currently, we don't fully support all metrics (e.g., accuracy, contextual recall, etc.), but we plan to release these features soon. Additionally, we provide an example evaluation dataset and metric to give members a preview.Screen