Knowledge-Graph-Hub / neat-ml

Network Embedding All the Things
BSD 3-Clause "New" or "Revised" License
18 stars 1 forks source link

Replace NEATs BERT text embedding stuff with Ensmallen's (better) version of this #45

Open justaddcoffee opened 2 years ago

justaddcoffee commented 2 years ago

Right now we are including BERT embeddings for textual elements in the graph in NEAT using a fairly naive average embedding of all text for a given node.

Ensmallen now supports a more sophisticated version fo this, with a better way of incorporating BERT embedding using for example weighting of embeddings using TF-IDF.

We should therefore replace NEATs version of this with Ensmallen's BERT functionality

justaddcoffee commented 2 years ago

Adding - Luca has a Jupyter notebook here that demonstrates how to use the Ensmallen API for this BERT embedding feature (see cell marked "TFIDF BERT embedding")