Closed lonsbio closed 5 years ago
Yes, looks like the pagination changed a bit. I did a quick fix using regular expressions #5 and it should work fine now. Thanks for opening this issue.
Thanks! I tried my own patch overnight (not as elegant) and it seemed to work too.
Also, I'm not sure if this is a recent issue or incidental. My DB download file seems to have newlines surrounding the organism field:
domain protein_name family tag organism_code ec genbank uniprot subfamily organism pdb
Ahos_0285 GH1 invalid AEE93176.1
Acidianus hospitalis W1
Fixing it does't seem to effect the extract script, but does make the csv (tsv) file readable. Is the wrapping intentional?
Unable to create database on Python 2.7.13. Output (exlcucing BeautifulSoup warning) as follows:
then error
Has the pagination code changed for the expression to fail?