issues
search
amosproj
/
amos2024ss08-cloud-native-llm
MIT License
6
stars
1
forks
source link
Implement a Script for LLM fine-tuning
#98
Open
grayJiaaoLi
opened
1 week ago
grayJiaaoLi
commented
1 week ago
User story
As a ML engineer
I want / need to implement a script for LLM fine-tuning on Hetzner machine
So that the fine-tuning can easily be performed using high performance GPU
Acceptance criteria
Implement the Fine-Tuning functions
Integrate the selected fine-tuning tools or solutions
Monitor the process
GPU usage
GPU storage, catch exceptions
Errors
Hyperparameters are systematically evaluated using tools
For the evaluation the metric specified in
https://github.com/amosproj/amos2024ss08-cloud-native-llm/issues/81
is used
The results of
https://github.com/amosproj/amos2024ss08-cloud-native-llm/issues/92
are incorporated
(Enable the
checkpoints
feature)
Prevent bad situations: broken down or lost connection
Definition of done (DoD)
Bill of Materials in the planning document has been updated
All the complex logics have been tested
All feature branches have been merged and closed
New feature code has been documented
Potential new licenses have been checked
All GitHub Actions are passing
The requirement.txt is updated
DoD general criteria
Feature has been fully implemented
Feature has been merged into the mainline
All acceptance criteria were met
Product owner approved features
All tests are passing
Developers agreed to release
dominic0df
commented
2 days ago
already invested SP05, left SP05
User story
Acceptance criteria
Definition of done (DoD)
DoD general criteria