weaviate / t2v-transformers-models

This is the repo for the container that holds the models for the text2vec-transformers module
BSD 3-Clause "New" or "Revised" License
39 stars 27 forks source link

text embedding average gives more weight to the last batch #11

Closed StefanBogdan closed 2 years ago

StefanBogdan commented 3 years ago

The current implementation gives more weight for the last batch if it has less sentences than all the previous ones. This can be fixed by adding all the sentence embeddings and normalize it by the number of sentences.