Closed jburel closed 3 years ago
This issue also affects the Bio-Formats documentation job (which also points to the perkinelmer website) and my attempt to use the latest released Sphinx (4.1.2) did not suffice to fix the issue - see https://merge-ci.openmicroscopy.org/jenkins/job/BIOFORMATS-linkcheck/544/console.
It looks like the PerkinElmer has gone through some changes and while it is available from the browser, programmatic access seems to be blocked server-side.
curl(base) sbesson@ls30630:~ $ curl -IL https://www.perkinelmer.com
HTTP/1.1 302 Found
Cache-Control: private
Content-Length: 158
Content-Type: text/html; charset=utf-8
Location: /corporate/error/500.html?aspxerrorpath=/
Server: Microsoft-IIS/8.5
X-AspNetMvc-Version: 5.2
X-AspNet-Version: 4.0.30319
X-Powered-By: ASP.NET
Access-Control-Allow-Origin: *
Access-Control-Allow-Headers: X-fancyBox,X-Requested-With
Content-Security-Policy: default-src 'self' *.perkinelmer.com scoremodel-perkin-elmer.pantheonsite.io fonts.googleapis.com https://d3d9hv42w7vz9m.cloudfront.net https://ips-invite.iperceptions.com https://www.googletagmanager.com/ https://www.youtube.com/ https://www.youtube.com/iframe_api https://www.google-analytics.com/ https://snap.licdn.com/ https://script.crazyegg.com/ https://connect.facebook.net https://platform.twitter.com https://img.en25.com https://s.yimg.com https://www.googleadservices.com https://bat.bing.com https://tag.demandbase.com https://static.ads-twitter.com/ https://googleads.g.doubleclick.net https://pixel.sitescout.com https://tracking.crazyegg.com https://www.facebook.com https://px.ads.linkedin.com https://id.rlcdn.com https://t.co https://api.company-target.com https://match.prod.bidr.io https://www.google.com https://www.google.de https://analytics.twitter.com https://segments.company-target.com https://tracking.crazyegg.com https://scoremodel-perkin-elmer.pantheonsite.io https://s1674556495.t.eloqua.com/ https://syndication.twitter.com resources.perkinelmer.com gateway.zscalertwo.net https://cdnapisec.kaltura.com;script-src 'self' 'unsafe-inline' 'unsafe-eval' *.visualwebsiteoptimizer.com gateway.zscalertwo.net *.perkinelmer.com *.kaltura.com http://*.kaltura.com hm.baidu.com use.fontawesome.com *.googletagmanager.com www.google-analytics.com script.crazyegg.com connect.facebook.net www.youtube.com img.en25.com www.google.com tag.demandbase.com snap.licdn.com platform.twitter.com s.yimg.com img04.en25.com www.googleadservices.com bat.bing.com sp.analytics.yahoo.com static.ads-twitter.com js-agent.newrelic.com analytics.twitter.com bam.nr-data.net googleads.g.doubleclick.net *.cloudfront.net *.google-analytics.com *.analytics.edgekey.net scoremodel-perkin-elmer.pantheonsite.io *.linkedin.com platform.linkedin.com ips-invite.iperceptions.com syndication.twitter.com cdn.syndication.twimg.com;style-src 'self' 'unsafe-inline' gateway.zscalertwo.net *.perkinelmer.com fast.fonts.net *.fontawesome.com fonts.googleapis.com code.jquery.com cdnjs.cloudflare.com *.cloudfront.net *.twitter.com scoremodel-perkin-elmer.pantheonsite.io;img-src 'self' 'unsafe-inline' *.visualwebsiteoptimizer.com *.perkinelmer.com *.kaltura.com *.baidu.com sp.analytics.yahoo.com cdnjs.cloudflare.com ad.doubleclick.net *.twitter.com cdn.syndication.twimg.com id.rlcdn.com match.prod.bidr.io www.facebook.com pixel.sitescout.com segments.company-target.com *.googletagmanager.com www.google.de www.google.com www.google.co.in s1674556495.t.eloqua.com ssl.google-analytics.com *.linkedin.com *.twimg.com bat.bing.com p.adsymptotic.com *.cloudfront.net t.co stats.g.doubleclick.net data: scoremodel-perkin-elmer.pantheonsite.io p.adsymptotic.com;connect-src gateway.zscalertwo.net *.visualwebsiteoptimizer.com 'self' *.perkinelmer.com *.kaltura.com www.google-analytics.com api.company-target.com script.crazyegg.com bat.bing.com s.yimg.com bam.nr-data.net segments.company-target.com s1674556495.t.eloqua.com *.cloudfront.net scoremodel-perkin-elmer.pantheonsite.io tracking.crazyegg.com stats.g.doubleclick.net linkedin.com *.analytics.edgekey.net;media-src 'self' *.kaltura.com *.cloudfront.net *.perkinelmer.com gateway.zscalertwo.net scoremodel-perkin-elmer.pantheonsite.io blob:;font-src 'self' *.perkinelmer.com fonts.gstatic.com fast.fonts.net *.cloudfront.net *.fontawesome.com cdnapisec.kaltura.com data: scoremodel-perkin-elmer.pantheonsite.io;worker-src 'self' *.perkinelmer.com gateway.zscalertwo.net blob:;frame-src *.youtube.com app.fluorofinder.com big.d.doubleclick.net gateway.zscalertwo.net *.facebook.com platform.twitter.com *.kaltura.com *.perkinelmer.com
Strict-Transport-Security: max-age=60
Date: Tue, 27 Jul 2021 09:37:30 GMT
Set-Cookie: BIGipServer~PE-PROD~perkinelmer-com_https=1059543205.47873.0000; path=/; Httponly; Secure; SameSite=none
HTTP/1.1 500 Internal Server Error
Cache-Control: private
Content-Length: 1763
Content-Type: text/html; charset=utf-8
Server: Microsoft-IIS/8.5
X-AspNet-Version: 4.0.30319
X-Powered-By: ASP.NET
Access-Control-Allow-Origin: *
Access-Control-Allow-Headers: X-fancyBox,X-Requested-With
Content-Security-Policy: default-src 'self' *.perkinelmer.com scoremodel-perkin-elmer.pantheonsite.io fonts.googleapis.com https://d3d9hv42w7vz9m.cloudfront.net https://ips-invite.iperceptions.com https://www.googletagmanager.com/ https://www.youtube.com/ https://www.youtube.com/iframe_api https://www.google-analytics.com/ https://snap.licdn.com/ https://script.crazyegg.com/ https://connect.facebook.net https://platform.twitter.com https://img.en25.com https://s.yimg.com https://www.googleadservices.com https://bat.bing.com https://tag.demandbase.com https://static.ads-twitter.com/ https://googleads.g.doubleclick.net https://pixel.sitescout.com https://tracking.crazyegg.com https://www.facebook.com https://px.ads.linkedin.com https://id.rlcdn.com https://t.co https://api.company-target.com https://match.prod.bidr.io https://www.google.com https://www.google.de https://analytics.twitter.com https://segments.company-target.com https://tracking.crazyegg.com https://scoremodel-perkin-elmer.pantheonsite.io https://s1674556495.t.eloqua.com/ https://syndication.twitter.com resources.perkinelmer.com gateway.zscalertwo.net https://cdnapisec.kaltura.com;script-src 'self' 'unsafe-inline' 'unsafe-eval' *.visualwebsiteoptimizer.com gateway.zscalertwo.net *.perkinelmer.com *.kaltura.com http://*.kaltura.com hm.baidu.com use.fontawesome.com *.googletagmanager.com www.google-analytics.com script.crazyegg.com connect.facebook.net www.youtube.com img.en25.com www.google.com tag.demandbase.com snap.licdn.com platform.twitter.com s.yimg.com img04.en25.com www.googleadservices.com bat.bing.com sp.analytics.yahoo.com static.ads-twitter.com js-agent.newrelic.com analytics.twitter.com bam.nr-data.net googleads.g.doubleclick.net *.cloudfront.net *.google-analytics.com *.analytics.edgekey.net scoremodel-perkin-elmer.pantheonsite.io *.linkedin.com platform.linkedin.com ips-invite.iperceptions.com syndication.twitter.com cdn.syndication.twimg.com;style-src 'self' 'unsafe-inline' gateway.zscalertwo.net *.perkinelmer.com fast.fonts.net *.fontawesome.com fonts.googleapis.com code.jquery.com cdnjs.cloudflare.com *.cloudfront.net *.twitter.com scoremodel-perkin-elmer.pantheonsite.io;img-src 'self' 'unsafe-inline' *.visualwebsiteoptimizer.com *.perkinelmer.com *.kaltura.com *.baidu.com sp.analytics.yahoo.com cdnjs.cloudflare.com ad.doubleclick.net *.twitter.com cdn.syndication.twimg.com id.rlcdn.com match.prod.bidr.io www.facebook.com pixel.sitescout.com segments.company-target.com *.googletagmanager.com www.google.de www.google.com www.google.co.in s1674556495.t.eloqua.com ssl.google-analytics.com *.linkedin.com *.twimg.com bat.bing.com p.adsymptotic.com *.cloudfront.net t.co stats.g.doubleclick.net data: scoremodel-perkin-elmer.pantheonsite.io p.adsymptotic.com;connect-src gateway.zscalertwo.net *.visualwebsiteoptimizer.com 'self' *.perkinelmer.com *.kaltura.com www.google-analytics.com api.company-target.com script.crazyegg.com bat.bing.com s.yimg.com bam.nr-data.net segments.company-target.com s1674556495.t.eloqua.com *.cloudfront.net scoremodel-perkin-elmer.pantheonsite.io tracking.crazyegg.com stats.g.doubleclick.net linkedin.com *.analytics.edgekey.net;media-src 'self' *.kaltura.com *.cloudfront.net *.perkinelmer.com gateway.zscalertwo.net scoremodel-perkin-elmer.pantheonsite.io blob:;font-src 'self' *.perkinelmer.com fonts.gstatic.com fast.fonts.net *.cloudfront.net *.fontawesome.com cdnapisec.kaltura.com data: scoremodel-perkin-elmer.pantheonsite.io;worker-src 'self' *.perkinelmer.com gateway.zscalertwo.net blob:;frame-src *.youtube.com app.fluorofinder.com big.d.doubleclick.net gateway.zscalertwo.net *.facebook.com platform.twitter.com *.kaltura.com *.perkinelmer.com
Strict-Transport-Security: max-age=60
Date: Tue, 27 Jul 2021 09:37:30 GMT
It is possible we need to pass the correct set of headers but from a quick investigation, a valid User-Agent
header is not sufficient and I cannot find what else is required. Excluding this URl from the linkcheck here and in the Bio-Formats documentation might be our best course of action for the moment. Any objections @dgault ?
No objections from my side, lets exclude it until we know more.
The job started failing today I have restarted it but it looks like a problem on PE side excluding for now cc @dgault