GT4SD / gt4sd-core

GT4SD, an open-source library to accelerate hypothesis generation in the scientific discovery process.
https://gt4sd.github.io/gt4sd-core/
MIT License
333 stars 69 forks source link

Enzeptional No Unmasking model loaded. #236

Closed Allostelic closed 5 months ago

Allostelic commented 5 months ago

I encountered an error while running the example_enzeptional.py script, which prevents the unmasking model from being loaded correctly. This issue seems to affect the optimization process, as the algorithm cannot effectively perform due to the missing model component.

The script fails to load the unmasking model and outputs the following error message:

ERROR:gt4sd.frameworks.enzeptional.processing: No Unmasking model loaded. Check your model inputs

It seems there might be a misconfiguration or path issue that prevents the unmasking model from being recognized by the system. Verifying the paths and configurations in the mutation_config might provide a solution.

drugilsberg commented 5 months ago

@Allostelic thanks for reporting this. @yvesnana the main maintainer is currently on holiday, but we will look into it as soon as he is back the coming week.

yvesnana commented 5 months ago

@Allostelic thank you for reporting this. I've looked into this and performed some tests on my end, and everything seems to be working correctly. However, to better understand and resolve the problem you're facing, I'd appreciate if you could provide me with a few more details: What operating system are you using (e.g., Windows, macOS, Linux)? and Can you please share the full traceback of the error message you're encountering? This will help me better understand the context of the issue.

Allostelic commented 5 months ago

Thank you for your reply. I have created and installed a new environment using Linux Anaconda. I was also interested in Kcat, so I followed the README, modified the folder in line 13 of Example_enzeptional.py from feasibility to kcat and ran the script. As a result, several errors appeared.

In case you are wondering, the array optimisation could be executed as it was with feasibility.

The errors are as follows. I tried it with several XGBoost versions, but none of them worked.

Some weights of EsmModel were not initialized from the model checkpoint at facebook/esm2_t33_650M_UR50D and are newly initialized: ['esm.pooler.dense.weight', 'esm.pooler.dense.bias'] You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference. ERROR:gt4sd.frameworks.enzeptional.processing:No Unmasking model loaded. Check you model inputs Some weights of the model checkpoint at facebook/esm2_t33_650M_UR50D were not used when initializing EsmForMaskedLM: ['esm.contact_head.regression.bias', 'esm.contact_head.regression.weight']

yvesnana commented 5 months ago

@Allostelic Thank you for your detailed report on the issues you encountered while running the Example_enzeptional.py script. I have addressed these problems and updated the example file as well as Enzeptional's core.py module.

The file now correctly handles the Kcat functionality and should run without the errors you previously encountered. Please feel free to try the updated example script and let me know if you encounter any further problems.