openhatch / oh-bugimporters

Bug importers for the OpenHatch project oh-mainline
https://oh-bugimporters.readthedocs.org/
GNU Affero General Public License v3.0
12 stars 28 forks source link

oh-bugimporters: Refuse to follow cross-domain redirects (within Bugzilla importer) #75

Open ghost opened 10 years ago

ghost commented 10 years ago

Comment by paulproteus:

When reading the bug importer logs, I noticed a bit of exciting insanity:

2013-05-17 01:24:58+0000 [Spider for importing using oh-bugimporters] DEBUG: Redirecting (302) to <GET https://bugzilla.mozilla.org/buglist.cgi?quicksearch=show_bug.cgi> from <GET https://bugzil.la/show_bug.cgi?...]

Nothing good can come of this redirect (-:

In particular, it indicates that the bug tracker is misconfigured:

  • The base_url should be bugzilla.mozilla.org, not bugzil.la in this case.

But moreover, we never should accept cross-domain redirects, but doing such a redirect indicates that the bug tracker is misconfigured.


Comment by blues_chick:

Investigating scrapy option settings for expanding shortened url bugzil.la to formal url bugzilla.mozilla.org before starting the web crawl


Status: chatting Nosy List: blues_chick, paulproteus Priority: bug Imported from roundup ID: 857 (view archived page) Last modified: 2013-10-05.18:35:58