support training, fix some inplace func of nn

hpcaitech / FastFold

Optimizing AlphaFold Training and Inference on GPU Clusters

Apache License 2.0

557 stars 86 forks source link

Closed Gy-Lu closed 1 year ago

Gy-Lu commented 1 year ago

This PR should be merged after the other two.

Add training code and its shell script, support DDP now.
Add lr_scheduler and move loss to hub.
Modify some bugs introduced by openfold, the inplace func in dropout and the unused out_product_mean in evoformer.

For unit test, it would cause OOM on CI machine. I am trying to develop a tiny model.

Shenggan commented 1 year ago

Seem still have some bugs: