Add other pooling strategies to `SentenceTransformer`

pyg-team / pytorch_geometric

Graph Neural Network Library for PyTorch

https://pyg.org

MIT License

20.52k stars 3.57k forks source link

Closed zechengz closed 1 week ago

zechengz commented 2 weeks ago

Add CLS pooling, which is frequently used for BERT like model. It extracts the first token hidden states as the embeddings.
Add last token pooling, which is frequently used for recent text embedding models with the decoder architecture, such as https://huggingface.co/intfloat/e5-mistral-7b-instruct. It basically uses the last token states as the embeddings.