Closed cmungall closed 2 days ago
Thanks @cmungall! The current pattern in the Bioregistry
^WB[A-Z][a-z]+\d+$
implicitly covers this part
WB(Gene|Var|RNAi|Transgene)[0-9]{8}
of the GO registry's pattern, so it looks like adding CE[0-9]{5}
is the only thing that would be needed here.
The patterns do have some added value for validation (is the ID valid?) / discovery (what kind of ID is this?) use cases which I think Bioregistry users are often faced with so I would rather extend the pattern than revert it to something completely generic.
bioregistry complains about this https://bioregistry.io/reference/wormbase:/CE00274
but this is a valid wormbase protein ID
The GO registry has info that could be used here but IMO bioregistry could just not bother having a regex for many local IDs, it's hard to keep them up to date