Closed cthoyt closed 7 years ago
The final portion is the 4th level, which refers to actual proteins that have the activities in the first 3 levels from the EXPASY tree. You can go through UniProt to get these annotations and use text parsing to infer the parents.
http://pyuniprot.readthedocs.io/en/latest/query_functions.html#ec-number
import pyuniprot
query = pyuniprot.query()
for ec in query.ec_number():
# get actual EC number, infer the parent, and the annotated children
i am gonna query only humans as it takes forever and has a memory leak
which part? does it not provide a lazy iterator over the database?
because we fixed the memory leak with loading and now it only uses 117mb total :)
it's looking fine now. seems to be working.. have to wait
It's directly possible to iterate over all instances of each enzyme class through PyUniProt.