glygener / glygen-issues

Repository for public GlyGen tickets
GNU General Public License v3.0
0 stars 0 forks source link

About Umaka-YummyData #1130

Open ReneRanzinger opened 8 months ago

ReneRanzinger commented 8 months ago

Hi Raja,

We DBCLS provide a SPARQL endpoint monitoring service called Umaka-YummyData ( https://yummydata.org/ , https://doi.org/10.1093/database/bay022 ). Its crawler issues a seriese of SPARQL queries to SPARQL endpoints to obtain their statuses. One day it seemed to be banned for it to access to the GlyGen server ( http://sparql.glygen.org:8880/sparql ) because I assume it gave the server heavy work loads. The purpose of issuing queries is to get endpoint statuses and make the results publicly available. So, we don't have any intention to make your server too busy, and it would be glad to us if you would permit its access. If there is any condition that we have to follow, please let me know. The crawler issues SPARQL queries with the following User-Agent value: User-Agent: Umaka-Crawler/1.0.0 by DBCLS ([umakadata@dbcls.jp](mailto:umakadata@dbcls.jp))

Best regards, Yasunori

ReneRanzinger commented 8 months ago

Dear Yasunori, Thank you for letting us know, and thank you for using GlyGen. We have no intention of banning valid users. As a bioinformatics resource, we want to follow what others are doing. Below is the NCBI recommendation. "NCBI recommends that users post no more than three URL requests per second and limit large jobs to either weekends or between 9:00 PM and 5:00 AM Eastern time during weekdays. Failure to comply with this policy may result in an IP address being blocked from accessing NCBI." We most likely are using something similar at our end. Cc-ing our University IT to confirm if my understanding is correct. Also, cc-ing few others at our end from GlyGen so that they can follow up.

ReneRanzinger commented 7 months ago

Dear Raja,

Thank you for your following up, and it seems that the rejction from our YummyData host remains in effect.

Sincerely, Yasunori