Closed puraminy closed 2 years ago
This is just a test to check all model variants are instantiated without errors. They are expected to give the same output before training, as LoRAs are initialised in such a way as to not alter the pre-trained model behaviour.
Well, but do you have some code of your experiments on supervised tasks? I wonder how they are applied. For example I tried to train a model with your loss function, but the loss didn't decrease during training and the output was also erroneous.
Hi, I read your paper and checked your code. I don't know what the following code does, could you please provide some explanation? What is the expected output? I decoded the output ant it's the same for all
skilled_variants
Moreover, I expected an input, output (label) pairs... How can I use the model for a multitasking supervised input-output problem?