OntoZoo / ontobee

Ontobee is a linked data server for ontologies. See: http://www.ontobee.org.
27 stars 5 forks source link

Schema changed, breaking our code #200

Closed jwodder closed 2 months ago

jwodder commented 3 months ago

We have a program that makes queries to URLs like http://purl.obolibrary.org/obo/NCBITaxon_10090 and extracts data from the first Class element satisfying some condition. However, the responses from these URLs recently changed so that there are no longer any Class elements; the elements seem to have been renamed to rdf:Description. Why is this happening? Is this change intentional?

CC @yarikoptic @satra

satra commented 3 months ago

@yongqunh - it looks like the xml representation has changed for the response. more generally we are interested in the details of a given URI. is there a better way to programmatically query the details instead of doing a GET on the ontobee service?

yongqunh commented 3 months ago

There appear to be some bugs during our recent server updating. We are now doing the debugging ... Sorry for the inconvience.

yarikoptic commented 2 months ago

Sorry to pester -- any progress @yongqunh ?

yongqunh commented 2 months ago

Sorry for the delayed reply. I would like to know more about your question. You said that the responses from these URLs recently changed so no Class elements any more, and the elements seem to have been renamed to rdf:Description. Actually, I don't know what happen. Can you provide more details about this? Note that we don't change (in any purpose) the Ontobee code; we only did some debugging to make it work, which may accidently change something, though. So I would like to know more about your concern. Can you provide more information? Thanks.

jwodder commented 2 months ago

@yongqunh The problem seems to have been resolved at some point; I can't tell when.

At the moment, a request to, say, http://purl.obolibrary.org/obo/NCBITaxon_10090 returns the following, which has the schema we want:

```xml Mus musculus NCBITaxon:85055 house mouse ncbi_taxonomy mouse GC_ID:1 Mus <subgenus> species ```

As of March 25, requests to the same URL were instead returning the following, which broke our code:

```xml ]> Mus musculus NCBITaxon:85055 GC_ID:1 mouse house mouse ncbi_taxonomy Mus <subgenus> species ```
yongqunh commented 2 months ago

I see. I am glad that this problem has been resolved mysteriously :-). It appears that the previous broken code has a slightly different format. It is possible that we did some debugging earlier for some debugging purpose, and their debugging resolved this issue without our notice. Anyway, it appears to be a good result. Please let me know if you have more concern. Otherwise, we may close this issue. Thanks!

yarikoptic commented 2 months ago

FWIW, I have now added (after fixing for allowing larger number of redirects, see #201) a check for the target content in such a response: https://github.com/dandi/upptime/blob/master/.upptimerc.yml#L48

  - name: PURL redirect example1 (ontobee)
    url: http://purl.obolibrary.org/obo/PATO_0000384
    maxRedirects: 6
    __dangerous__body_down_if_text_missing: '<Class rdf:about="http://purl.obolibrary.org/obo/PATO_0001894">'

anyone ( @yongqunh ) wants to be added as a target for alerts happen such a test (or also ontobee just going down) fails? ;-)

yongqunh commented 2 months ago

Sounds good. Is it monitored automatically? How it works? Thanks.

yarikoptic commented 2 months ago

Ok, I will add you ;-)

Yes, automatically, see https://github.com/upptime/upptime project which pretty much provides a number of ci jobs which periodically hit the prescribed urls to get timing/availability information and then render time trends in the GitHub repo. Also it takes nice care about opening issue when resource disappears, it closing it when it reemerges

ontoden commented 2 months ago

Great. Thanks!

From: Yaroslav Halchenko @.> Sent: Wednesday, April 17, 2024 12:17 AM To: OntoZoo/ontobee @.> Cc: Subscribed @.***> Subject: Re: [OntoZoo/ontobee] Schema changed, breaking our code (Issue #200)

External Email - Use Caution

Ok, I will add you ;-)

Yes, automatically, see https://github.com/upptime/upptime project which pretty much provides a number of ci jobs which periodically hit the prescribed urls to get timing/availability information and then render time trends in the GitHub repo. Also it takes nice care about opening issue when resource disappears, it closing it when it reemerges

- Reply to this email directly, view it on GitHubhttps://github.com/OntoZoo/ontobee/issues/200#issuecomment-2060312780, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ABPQBGKN4X7S3SYHBQ3STJ3Y5XZTHAVCNFSM6AAAAABFHKW44WVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANRQGMYTENZYGA. You are receiving this because you are subscribed to this thread.Message ID: @.**@.>>


Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues

yarikoptic commented 2 months ago

did assignment of @yongqunh in https://github.com/dandi/upptime/commit/702fe0331f144667fd0a716cb812e76d0b545396

Feel welcome to submit PR if want to have others added. I guess we could consider this issue resolved for now AFAIK ;)

yongqunh commented 2 months ago

Great. I will then close this issue for now. Thanks.