Is your feature request related to a problem? Please describe.
Currently, users have to parse the annotated file to subselect structures that incorporate domains of interest. This should be integrated directly into integration, where the full annotation file and a subset are included. However, right now, we don't include InterPro ID and so we rely on a text search for filtering. This is not ideal as we may have subsets "like PH" would hit "PH" and "Phe_zip".
Describe the solution you'd like
A more integrated pipeline to produce the structure reference file using InterPro ID as the filtering step.
Tasks
[x] Add InterPro ID's to the text field of domains covered in a structure
[x] Add filtering to produce the subset file based on the IPR ID of interest.
Is your feature request related to a problem? Please describe. Currently, users have to parse the annotated file to subselect structures that incorporate domains of interest. This should be integrated directly into integration, where the full annotation file and a subset are included. However, right now, we don't include InterPro ID and so we rely on a text search for filtering. This is not ideal as we may have subsets "like PH" would hit "PH" and "Phe_zip".
Describe the solution you'd like A more integrated pipeline to produce the structure reference file using InterPro ID as the filtering step.
Tasks