oborchers / Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!
GNU General Public License v3.0
616 stars 83 forks source link

Handling out of vocabulary #50

Closed Filco306 closed 2 years ago

Filco306 commented 2 years ago

Hello!

I am using this package to compile reasonable word vectors, but for some short compilations of words, all my words are OOV. I tried using FastText, but I get:

*** RuntimeError: Model must be child of BaseWordEmbeddingsModel or BaseKeyedVectors. Received FastText(vocab=2519370, size=300, alpha=0.025)

Is it possible to use FastText and handle Out of vocabulary words?

Thank you!

Filco306 commented 2 years ago

Perhaps there is an example somewhere on how to use FastText capable of handling OOV with the uSIF model?

Filco306 commented 2 years ago

Managed to solve this.