y-hwang / gLM

Genomic language model predicts protein co-regulation and function
https://www.biorxiv.org/content/10.1101/2023.04.07.536042v3
Other
66 stars 10 forks source link

How to generate files such as "operon.annot", "norm.pkl", "pca.pkl" and "operon_predictor.pkl" #11

Open Jigyasa3 opened 3 weeks ago

Jigyasa3 commented 3 weeks ago

Hi, I am interested in generating the following files-

  1. "operon.annot"- It looks like its a protein ID taken from Prokka annotation? I have non-model strains that I am looking at and most of the genes are hypothetical. Will it still work?

  2. The files "norm.pkl", "pca.pkl" and "operon_predictor.pkl" can be used for any genome of interest?