NLP-in-the-Social-Sciences / Reddit-Data-Pipeline

Code and data we are using for facilitating an ETL pipeline for Low SES research
GNU General Public License v3.0
0 stars 1 forks source link

`sentence-transformer` model double-load #28

Open MoRevolution opened 1 year ago

MoRevolution commented 1 year ago

The first load of the ''multi-qa-distilbert-cos-v1" model doesn't encode the narratives in our dataset properly. If I do reload the model, however, encoding works fine. I replicated this on three different machines, but not sure if this is a big priority. Anyways, it's worth a check.