Open liuxm117 opened 2 years ago
It's difficult to say what might be happening here. OpenFold is designed for proteins, so training it on small molecules like this one requires extensive modifications, I imagine. All I can say is that our own trained version of OpenFold has matched AlphaFold's performance on proteins.
Congratulations, hope to see your trained weights soon, 1、what precision are you using,use fp16, loss is always nan 2、In the later stage of training, the loss has been oscillating and no longer decreasing,can you give me some advice,which part or which parameter should I pay attention to
thank you for your reply,I use pdb_mmcif as train dataset, I almost didn't modify the code, just did some memory optimization to speed up the performance,maybe it's because I don't have enough gpus
Oh are you not doing some kind of small molecule conformation prediction? I assumed that you're not doing the standard protein folding task from that image you sent. If you are just doing normal protein folding, I might be able to help you. How many GPUs are you using?
I use pymol to visualization predicted pdb,but it doesn't show anything ,this image is visualization by another tool, I used 24GPUS, Almost 100w samples were trained
How are the metrics looking? What is your current LDDT-Ca?
Here's our validation LDDT curve. The x axis records batches of size 132, so the plateau at 0.8 LDDT is reached after the model's seen approx. 660k proteins. This behavior has been extremely consistent across our training runs. Perhaps you just need to train longer?
due to the limitation of the number of GPUs, I cannot increase the length of batch_size,Could this be the problem? Where did your validation set come from?(cameo? or just a part of trainset)
Yes CAMEO.
Hello @liuxm117 and all!
Can you explain please, how did you use your *.ckpt
files for for prediction? Which script did you use for test with ckpt files?
In Readme is mentioned only about *.pt
.
@gahdritz @liuxm117 Can you explain please, how did you use your .ckpt files for for prediction as a model? Or how can I convert it to .npz? If I try to convert my ckpt I've got a lot of errors about different structure of my ckpt.
hi, I trained openfold and use the ckpt to test sequence, but it was not correct, lots of atoms overlapped together ![Uploading 微信图片_20220525170716.jpg…]()