Why the pml's output is sum of different verbalizations, is not the max of different verbalization ?

timoschick / pet

This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"

Apache License 2.0

1.62k stars 282 forks source link

Thanks for sharing code!!!, I have a little question.

You can also define multiple verbalizations for a single label. For example, if you are unsure which words best represent the labels in a binary sentiment classification task, you could define your verbalizer as follows: VERBALIZER = {"+1": ["great", "good", "wonderful", "perfect"], "-1": ["bad", "terrible", "horrible"]}

If we want find the best represent verbalizations, Why the pml's output is sum of different verbalizations, is not the max of different verbalization? cls_logits = cls_logits.sum(axis=1) / filler_len in line 229 of code

Thank very much

timoschick / pet

Why the pml's output is sum of different verbalizations, is not the max of different verbalization ? #63