mhueppe / machineLearningProject_jaNoMi

This is a public Repository to manage the Machine Learning Project for WS 2024/25.
GNU General Public License v3.0
1 stars 0 forks source link

Idea: Look into multi-phase training approach #19

Open nllink opened 10 hours ago

nllink commented 10 hours ago

Training Stages Stage 1: Train a language model on abstracts alone, for it to learn the vocabulary of our scientific domain. Stage 2: Fine-tune on abstract-title pairs Stage 3: Fine-tune further using reinforcement learning ?? Maybe with our metrics as reward function?