Open grayJiaaoLi opened 2 weeks ago
Implement the following evaluations:
Use the same test data set
The selected Quantitative Metrics should be also compared for different models
Documenting our impressions about the fine-tuned LLM's answers
Document the results and upload the comparison results on Github
User story
Acceptance criteria
Implement the following evaluations:
Use the same test data set
The selected Quantitative Metrics should be also compared for different models
Documenting our impressions about the fine-tuned LLM's answers
Document the results and upload the comparison results on Github
Definition of done (DoD)
DoD general criteria