Closed YixinSong-e closed 10 months ago
I've been using https://github.com/jondurbin/qlora (train.py) for the SFT phase. For now I copied over the original to this repo, will look at minifying it again.
Very nice work! Currently, my SFT-training has started running. By the way, if I want to enhance the capabilities of MMLU, do you have any recommended datasets?
Good question. You would need to capture the results of the MMLU benchmark, then identify the specific categories/topics the model is underperforming in. Once you identify the area(s) the model is lacking in, it could be as simple as including an existing dataset that covers the topic(s), or perhaps generating a synthetic Q/A pair dataset from wikipedia articles or the like.
Thanks! :)