bridgedb / datasources

Repository with the BridgeDb data source.
Creative Commons Zero v1.0 Universal
4 stars 8 forks source link

Unrecognised HGNC URI pattern #6

Closed ianwdunlop closed 3 months ago

ianwdunlop commented 6 years ago

The pattern http://identifiers.org/hgnc/HGNC%253A29350 was found in a linkset but was not recognized by BridgeDB. However it does redirect to the correct place which suggests that identifiers.org does recognize it. So, either the pattern needs added to BridgeDB or the linkset file changed. The URI looks like it has been escaped wrong in some way.

egonw commented 6 years ago

@ianwdunlop it comes from the http://bridgedb.org/data/linksets/release87/MusMusculus/Ensembl_Mm_dataset.void.ttl file, right?

@JonathanMELIUS, can you please check that file for HGNC%253Axxxx identifiers, and see if your code introduces that double escaped ':' and how we can solve that?

ianwdunlop commented 6 years ago

I found the pattern in http://bridgedb.org/data/linksets/release87/MusMusculus/Ensembl_Mm_hgnc.projection.LS.ttl It may be in other files, I haven't looked

egonw commented 1 year ago

Bioregistry/Identifiers.org (via Bioregistry):

BridgeDb has both.

egonw commented 3 months ago

Closing this for now. So much has changed, and the IMS currently needs a lot of attention anyway, but still no funding for this.