amosproj / amos2024ss08-cloud-native-llm

MIT License
6 stars 1 forks source link

Conduct Comparison of Evaluation Between Models #91

Open grayJiaaoLi opened 2 weeks ago

grayJiaaoLi commented 2 weeks ago

User story

  1. As a Machine Learning Engineer
  2. I want/need to compare the evaluation results between different models.
  3. So that we can see if our fine-tuned model improves performance on CNCF-related questions.

Acceptance criteria

Definition of done (DoD)

DoD general criteria