[P1] Is it possible to merge the base model + REFT model into only model?

@celsowm Thanks for your question! Currently, it is not possible to merge the "effect" of an intervention into the model weights, as we are intervening on a parameter-less stream - the residual stream.

However, I do want to point out that the "inability to merge" is actually a feature of ReFT. For instance, if you train a set of interventions on the same base LM, and each intervention is trained to adapt to a distinct domain, you can actually attach different interventions on the fly to steer model behaviors differently with a minimum switch cost.

I am closing this issue, but feel free to follow up if you have more questions!

stanfordnlp / pyreft

[P1] Is it possible to merge the base model + REFT model into only model? #99