Right now, we are training the discriminator on the model's generated predication of a prompt -- this is causing us to exhaust memory.
Can we train a discriminator to directly classify whether different writing sample's are from different demographic groups and use this to optimize the prompt?
Right now, we are training the discriminator on the model's generated predication of a prompt -- this is causing us to exhaust memory. Can we train a discriminator to directly classify whether different writing sample's are from different demographic groups and use this to optimize the prompt?