vborincenturion / feeds

Food wastE biopEptiDe claSsifier: from microbial genome to biopeptides function
GNU General Public License v3.0
1 stars 0 forks source link

ids does not correspond to any enzyme #1

Open Javi-Rop opened 6 months ago

Javi-Rop commented 6 months ago

I hope this message finds you well. First and foremost, I wanted to express my deepest gratitude for the incredible work you've put into developing [nombre del proyecto]. It's been an invaluable tool for my research, and I've truly appreciated the effort and dedication you've poured into it.

I wanted to share my experience using the tool and provide some feedback on the process. I recently downloaded the genome of a bacterium in .fna format and a substrate with the proteome in .faa format. Following the instructions provided in the repository, I executed the script with the following command: python feeds.py -t 4 -k bacteria -f_length 100 -d s

However, during execution, I encountered some errors related to enzyme IDs. Specifically, I received an "Entry Error" message indicating that certain IDs did not correspond to any enzyme. Upon further investigation, I noticed that IDs '48', '49' and '51' were causing problems.

Despite facing these challenges, I remain excited about the potential of this project and eager to continue exploring its capabilities. If there are any additional troubleshooting steps or suggestions you can provide to fix these errors, I would greatly appreciate it.

Once again, thank you for your hard work and dedication to the advancement of scientific research. Your contributions have had a significant impact and I am truly grateful for the opportunity to use such a valuable tool.

vborincenturion commented 6 months ago

Dear Javi-Rop,

Thank you for your message and kind words regarding FEEDS. I'm glad to hear that the tool has been valuable for your research, and I appreciate your feedback.

To better assist you, could you provide more information about the specific error message you received? Additionally, if possible, could you share the FEEDS output, genome and the substrate files? This would help me to replicate the issue and provide more targeted troubleshooting steps.

Javi-Rop commented 6 months ago

¡Thank you for your prompt response and willingness to help! I really appreciate your interest in addressing the issues I've encountered with FEEDS.

The input genome and substrate files I've used are as follows:

Genome: Pseudomonas putida, GCA_000412675.1_ASM41267v1_genomic.fna Substrate: Sus scrofa, protein.faa The error I've been encountering is related to enzyme IDs during script execution: image

I've noticed that the RPG library has 46 available enzymes (https://rapid-peptide-generator.readthedocs.io/en/latest/enzymes.html#enzymes). New enzymes can be added using the "rpg -a" command (I still need to investigate this).

For now, I've managed to solve the problem by removing IDs greater than 46 in the extract_rpg_cs_values function, although I don't believe this is the correct approach.

I greatly appreciate your assistance and guidance.

vborincenturion commented 6 months ago

Hi Javi,

Did you locate and replace the "rpg_user.py" file in your RapidPeptideGenerator tool with the one in the db folder of this GitHub repository? I added other enzymes to the RapidPeptideGenerator tool, so I think this might be why you encountered errors.

Let me know if the error persists.

All the best.

Javi-Rop commented 6 months ago

Hi there,

Thank you for your prompt response and suggestion. I've been checking my directory ./miniforge3/envs/feeds/lib/python3.9/site-packages/rpg, but I couldn't find a file named "rpg_user.py". However, I've noticed the presence of a very similar file called "enzymes_definition.py". It seems like it could be an updated or renamed version of the file you mentioned. I'll go ahead and replace it with the file from the "db" folder of your repository and will let you know about any relevant results or changes.

Additionally, I'd like to bring to your attention another error I've encountered. It seems to be related to the BioPython library, specifically in the molecular_weight function where an invalid unambiguous letter for protein, 'X', is encountered. I'm hopeful that updating the enzyme definitions might address this issue as well.

Thanks again for your help and guidance.

Best regards.