Closed jeanetteclark closed 6 months ago
I think not following the redirect would defeat the purpose of the check. However, have you tried masquerading as a browser user agent and see if Wiley lets the redirect go through instead of returning a 503? You could check this with curl pretty quickly, especially if you use the Firefox "Copy as curl" command that captures all of the cookies and other paraphernalia sent by the browser to the server.
@mbjones I have already tested that and it works, but I thought we had decided a while back that it was not good practice to have metadig-engine impersonate a browser. Let me know if I am misremembering that
after some discussion, I'll implement and test the following:
try with a regular UA field identifying us, if it passes then PASS, if it fails with a 4xx code then FAIL; if it fails with a 5xx code, then retry with a browser user agent, and if it succeeds put it as PASS, if it fails then put it as a FAIL
I'll also add some text to the output message indicating what happened. Finally, we can change this check to informational to keep it from affecting the score
unfortunately, after testing I wasn't able to find any way to impersonate a web browser well enough to trick the wiley page into letting me through. After consulting with @JEDamerow I removed the check from the ESS-DIVE suite
Description
Taken from #437, here is a summary of what happens with this check:
The code that runs the above is actually in metadig-py, here
Solution
@JEDamerow brought this to my attention a while back, and we decided a middle ground was to make the check optional. It sounds like now the preference is to remove it all together.
The only other option that I can see is to alter the check procedure for DOI type URLs to not follow the redirect and assume that the follow will work correctly. Thoughts on that @mbjones?