Closed HobnobMancer closed 1 year ago
Hi, when testing the tool I noticed that I had problems at this step and after searching I noticed that the requests via: get_uniprot_accessions() from https://github.com/HobnobMancer/saintBioutils/blob/master/saintBioutils/uniprot/__init__.py were failing. Apparently the UniProt API has recently changed. Maybe this is helpful for replacing the queries: https://github.com/multimeric/Unipressed
Hi,
Thanks for using cazy_webscraper
!
I found the cause of the issue a couple of weeks back. It wasn't with saintBioutils
, the minimum required version of bioservices
needed to be updated - but I forgot to document this here, so my bad!
The Fix
If you install the latest version of bioservices
then cazy_webscraper
will be able to communicate with the new UniProt API.
The required bioservices
version will be updated shortly.
In the next couple of weeks, we will also altering how the cazy_webscraper
links NCBI protein version accessions to their corresponding record in UniProt. A more robust method for identifying records that are related (i.e. linking a NCBI protein record to it's corresponding UniProt record) is planned to be available in 2.2.4.
Thanks for the quick response.
Are you sure that the issue is related to the bioservices version? I had 1.10.4 (the latest?) installed.
I went through the script:
https://github.com/HobnobMancer/cazy_webscraper/blob/master/cazy_webscraper/expand/uniprot/get_uniprot_data.py
and where it is failing for me is at the EMBL to Uniprot accessions mapping step through saintBioutils
. The returned uniprot_gkb_dict
is empty in:
https://github.com/HobnobMancer/cazy_webscraper/blob/master/cazy_webscraper/expand/uniprot/get_uniprot_data.py#L180
When testing the other script it was failing at the request (or L94):
https://github.com/HobnobMancer/saintBioutils/blob/master/saintBioutils/uniprot/__init__.py#L98
This issues should now be fixed in v2.2.3 - PR #103
Describe the bug
When using
cw_get_uniprot_data
to retrieve data from UniProt, no data is retrieved and added to the local CAZyme databaseTo Reproduce
cazy_webscraper <email> -o cazy.db
cw_get_uniprot_data cazy.db --families 20 --pdb
No data is retrieved from UniProt.
Expected behavior
Retrieve data from UniProt and add to the local CAZyme database