steineggerlab / foldseek

Foldseek enables fast and sensitive comparisons of large structure sets.
https://foldseek.com
GNU General Public License v3.0
819 stars 101 forks source link

Retrain Foldseek's VQ-VAE and compile it with the main program. #272

Closed YuanLiu-SJTU closed 2 months ago

YuanLiu-SJTU commented 6 months ago

I am very interested in your excellent work Foldseek. I want to re-train it on my own dataset based on your GitHub repository “foldseek-analysis”. I checked the code and found that it saves three files: “encoder. pt, decoder. pt, and states.txt”. Then how can I use it to compile a complete executable program with foldseek's source code? Could you please provide some guidance? Thank you very much.

mvankem commented 5 months ago

Note that the substitution matrix (https://github.com/steineggerlab/foldseek/blob/master/data/mat3di.out) also needs to be changed.