dice-group / dice-embeddings

Hardware-agnostic Framework for Large-scale Knowledge Graph Embeddings
MIT License
50 stars 14 forks source link

Linear batch size finding for Tensor Parallel Training #275

Closed Demirrr closed 6 days ago