rcap107 / embdi

EmbDI is a table embeddings algorithm that solves data integration problems by converting tabular data into graphs, then applying word2vec to the graph to obtain embeddings.
Apache License 2.0
7 stars 1 forks source link

Write a separate main file for training only #7

Open rcap107 opened 6 months ago

rcap107 commented 6 months ago

There should be a main file that can run the walk generation + embeddings training steps of the pipeline without the clutter involved in testing the embeddings.

This main would take a configuration file that contains the walks and embeddings hyperparameters, and produces the embeddings (and possibly walks) file, and nothing else.

rcap107 commented 6 months ago

First implementation in commit f17d3e2bb4e978597b499c245eb04d9b90d857ab