Right now we are including BERT embeddings for textual elements in the graph in NEAT using a fairly naive average embedding of all text for a given node.
Ensmallen now supports a more sophisticated version fo this, with a better way of incorporating BERT embedding using for example weighting of embeddings using TF-IDF.
We should therefore replace NEATs version of this with Ensmallen's BERT functionality
Adding - Luca has a Jupyter notebook here that demonstrates how to use the Ensmallen API for this BERT embedding feature (see cell marked "TFIDF BERT embedding")
Right now we are including BERT embeddings for textual elements in the graph in NEAT using a fairly naive average embedding of all text for a given node.
Ensmallen now supports a more sophisticated version fo this, with a better way of incorporating BERT embedding using for example weighting of embeddings using TF-IDF.
We should therefore replace NEATs version of this with Ensmallen's BERT functionality