arXiv / arxiv-browse

Flask app for article abstract and listing pages
MIT License
117 stars 63 forks source link

https URLs redirect to http #718

Open tomhughes opened 2 months ago

tomhughes commented 2 months ago

Describe the bug An https URL that redirects like https://arxiv.org/pdf/2308.16025.pdf will redirect to http://arxiv.org/pdf/2308.16025 losing the https and meaning that, for example, firefox will not automatically open the document without the user confirming it is safe.

To Reproduce Steps to reproduce the behavior:

  1. Go to https://arxiv.org/pdf/2308.16025.pdf

Expected behavior A redirect to https://arxiv.org/pdf/2308.16025

Additional context I'm not sure if this is the right repo... I can see that https://github.com/arXiv/arxiv-browse/blob/92ac910dd41b9e5a96d0ee5eb9c824b68eac4ef7/browse/routes/dissemination.py#L20 just asks flask to build the redirect URL so I guess it comes down to the flask config and I assume it thinks it is http because it's behind a proxy or load balancer of some sort that is terminating the https

Possibly it's down to https://github.com/arXiv/arxiv-browse/blob/92ac910dd41b9e5a96d0ee5eb9c824b68eac4ef7/browse/factory.py#L67 in which case the issue is presumably in how https://github.com/arXiv/arxiv-base sets the canonical URL?