The forward() and generate() call by default returns the output of unintervened model. This could cause slow down during training and inference. We disable them by default.
Testing Done
notebook is updated.
Checklist:
[x] My PR title strictly follows the format: [Your Priority] Your Title
Description
The
forward()
andgenerate()
call by default returns the output of unintervened model. This could cause slow down during training and inference. We disable them by default.Testing Done
notebook is updated.
Checklist:
[Your Priority] Your Title