turboLJY / Transfer-Prompts-for-Text-Generation

16 stars 1 forks source link

Questions about the paper? #3

Open skpig opened 2 years ago

skpig commented 2 years ago

Thanks for the remarkable work. I've got questions about the paper. It'll be appreciated if you could help me.

  1. How to achieve/train the three learned vectors: cluster key, prompt key & target task query ?
  2. After retrieving prompt ~p, what parameters are trained in the learning process of target task? Section 5.1 says "only tune the adaptive attention mechanism". But mentioned in the last paragraph of 4.2, the ~p is fixed. Isn't that contradictory? I'm quite confused.
turboLJY commented 2 years ago

Hi,

In the learning process of target task, we only train the cluster key and prompt key (corresponding to a source prompt), and the task query (corresponding to the target task). Through the above three parameters, we can compute the retrieved prompt ~p (Eq. 4). The mention '~p is frozen' in our paper maybe a little weird, just skip it.