nestauk / old_nesta_daps

[archived]
MIT License
18 stars 5 forks source link

[293] Text2Vector with luigi #304

Closed jaklinger closed 4 years ago

jaklinger commented 4 years ago

Closes #293 Closes #303

Convert text to vectors, via BERT in batches with luigi. Note, I've written a very generic module so that we can convert any text field in the database to vectors, which are then stored in MySQL JSON format.

A subsequent PR addresses the soft clustering described in #293, which will also be generic.