Hello, how do you get the results for w/o ITI, do you manually put intervensions = {} in alt_tqa_evaluate function?
Also i have another question, how do you save the new model after changing the activations direction ? if i save the model under with TraceDict(model, layers_to_intervene, edit_output=intervene) as ret: will that produce the model with ITI used?
Hello, how do you get the results for w/o ITI, do you manually put intervensions = {} in alt_tqa_evaluate function?
Also i have another question, how do you save the new model after changing the activations direction ? if i save the model under
with TraceDict(model, layers_to_intervene, edit_output=intervene) as ret:
will that produce the model with ITI used?