[cls] or pooled embedding?

JayYip / m3tl

BERT for Multitask Learning

https://jayyip.github.io/m3tl/

Apache License 2.0

545 stars 125 forks source link

[cls] or pooled embedding? #107

Closed EdwardChan5000 closed 2 years ago

EdwardChan5000 commented 2 years ago

When i using event_cls to access a classification, the predicted labels come from cls or pooled embedding? If you can point that part of code to me, would be great helpful to me. Thanks.

JayYip commented 2 years ago

In most cases, [CLS] token embedding and pooled embedding are the same.

But some transformer model outputs don't have pooler outputs. In that case, pooler outputs would be token-wise average embedding.

https://github.com/JayYip/m3tl/blob/master/m3tl/modeling.py#L54 https://github.com/JayYip/m3tl/blob/master/m3tl/problem_types/cls.py#L38