Auto KGE Feature - Githubissues

dice-group / dice-embeddings

Hardware-agnostic Framework for Large-scale Knowledge Graph Embeddings

MIT License

48 stars 13 forks source link

Auto KGE Feature #34

Closed Demirrr closed 1 year ago

Demirrr commented 2 years ago

For a given dataset and a knowledge graph embedding model, make a sequence decisions so that our computation and time budgets are fully utilized, specficaly

Find a combination of the batch size and size of embedding vectors that fully uses the available hardware.
Through validation set performance, we may study adaptive rates for drop outs

Demirrr commented 2 years ago

Detect available GPUs and the size of the KG and move to model paralleisim from data parallelsim

Demirrr commented 1 year ago

Update:

Find a combination of the batch size and size of embedding vectors that fully uses the available hardware.

Possible workflow:

Initialize the model and check the CPU usage
Perform dummy forward and backward pass to compute memory usage
Assume that (2) is a "good" approximation of actual memory usage in CPU/GPU etc.
Increment the batch size until out-of-memory issue occurs.

Demirrr commented 1 year ago

Auto batch finder is available here.