nishantsubramani / steering_vectors

Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings
9 stars 0 forks source link

Question about inference #1

Open seongminp opened 2 years ago

seongminp commented 2 years ago

Hello. Thank you for open-sourcing your interesting work!

I had a question about how to extract z_steer given sentence x at inference time.

In the unsupervised sentiment transfer example, we perform arithmetic between steering vectors for each sentence. After training, would we not have just a single vector for z_steer? How do we recover z_steer for individual sentences?

Thank you!

seongminp commented 2 years ago

I understand now... Training and inference are not separate processes. The algorithm (complete with backdrop) is supposed to run for every new sentence we encounter.