Inferring super large protein with reduced accuracy

Hi! I'm trying to infer a ~5500 aa protein under monomer mode, using a single 40G card. sadly, the process halt with error of insufficient video memory, even with chunk size set at only 4 or even smaller. I noticed that in your demo, if switch to bf16 (or fp16?), sequences < 8000 aa can be inferred with 40G memory. thus, how do i switch from tf32 to bf16 (or possibly fp16) to save video memory?

v2-b74d290f4bdccd8bda91f7aac24a2f16_r

hpcaitech / FastFold

Inferring super large protein with reduced accuracy #185