Open fredzannarbor opened 1 year ago
The training process adjusts the weights so that for every given input the desired output is generated. Here is an introductory course on how transformers work: https://huggingface.co/course/chapter1/4?fw=pt
We are currently working on cleaning up and expanding upon the training data provided by the Stanford team here, if you'd like to contribute: https://github.com/gururise/AlpacaDataCleaned
Ability to load arbitrary training datasets have been merged in https://github.com/tloen/alpaca-lora/pull/175
What is the science behind the seed questions? If you added, say, 10% poetry, would the resulting set of 57,500 instructions produce an alpaca that was better at poetry? Have people tested different groups of seed questions expanded in different ways?