PrincetonML / SIF

sentence embedding by Smooth Inverse Frequency weighting scheme
MIT License
1.08k stars 306 forks source link

pre-trained SIF! #32

Open nstfk opened 5 years ago

nstfk commented 5 years ago

Are there any pre-trained SIF available to be used out of the box ?

damienlancry commented 5 years ago

I dont think so although you only need a pre trained word embedding and a dictionnary of unigram probabilities to make your own SIF. also the PCR is supposed to be to be particular of your corpus but i guess you could just compute the first singular vector of a generic corpus like wikipedia or common crawl