The current code in #30 can be configured to generate only classes with a given set of tax_ids. By adding one more pass through nodes.dmp we could find all the ancestors of a given set of tax_ids, then generate a subset from that. I expect that would be more efficient than extracting from .obo like this https://github.com/obophenotype/ncbitaxon/blob/master/subsets/Makefile#L8
The current code in #30 can be configured to generate only classes with a given set of
tax_id
s. By adding one more pass throughnodes.dmp
we could find all the ancestors of a given set oftax_id
s, then generate a subset from that. I expect that would be more efficient than extracting from.obo
like this https://github.com/obophenotype/ncbitaxon/blob/master/subsets/Makefile#L8