Closed DavidUdell closed 10 months ago
The shape of the activations tensor cached by hooks_manager will change, but this should be possible, because the select subset of the dataset is all facing a single ablation hook at one time. This is a runtime bottleneck.
hooks_manager
Actually, upside here is more limited than I had hoped.
The shape of the activations tensor cached by
hooks_manager
will change, but this should be possible, because the select subset of the dataset is all facing a single ablation hook at one time. This is a runtime bottleneck.