trappedinspacetime / wikiteam

Automatically exported from code.google.com/p/wikiteam
0 stars 0 forks source link

Some wikkii wikis die with HTTP Error 302: The HTTP server returned a redirect error that would lead to an infinite loop #78

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
I'm having problems downloading the following wikis:

drwxr-xr-x 2 federico users     3 Nov  7 17:01 
adventurecraftwikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  7 17:02 amzwikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  7 17:02 
animebathswikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  7 17:14 bbnwikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  7 17:15 
biowiki2wikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  7 21:12 
corporispublicawikkiicom_w-20131107-wikidump
drwxr-xr-x 2 federico users     3 Nov  8 05:39 dyoswikkiicom_w-20131108-wikidump
drwxr-xr-x 2 federico users     3 Nov  8 07:41 
encytivewikkiicom_w-20131108-wikidump
drwxr-xr-x 2 federico users     3 Nov  8 10:30 
evcaocwikkiicom_w-20131108-wikidump
drwxr-xr-x 2 federico users     3 Nov  8 22:29 
gintamawikkiinet_w-20131108-wikidump
drwxr-xr-x 2 federico users     3 Nov  8 23:00 goonwikkiicom_w-20131108-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 00:05 
gsuphilosophywikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 01:17 
herwekswikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 04:59 
jalotrowikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 14:42 
meowpediawikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 22:08 
multiversalwikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov  9 23:44 
nathansgenealogywikkiicom_w-20131109-wikidump
drwxr-xr-x 2 federico users     3 Nov 10 02:06 
oregonhousedistrict44wikkiicom_w-20131110-wikidump
drwxr-xr-x 2 federico users     3 Nov 10 05:54 
postamundiwikkiicom_w-20131110-wikidump
drwxr-xr-x 2 federico users     3 Nov 10 10:50 
river01wikkiicom_w-20131110-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 00:11 tarqwikkiicom_w-20131111-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 05:47 
tmpediawikkiicom_w-20131111-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 06:02 
tramswikkiicom_w-20131111-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 10:14 ufowikkiicom_w-20131111-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 13:09 
varsoviowikkiicom_w-20131111-wikidump
drwxr-xr-x 2 federico users     3 Nov 11 22:30 zoowikkiicom_w-20131111-wikidump

Everything seems to go well then the titles download (by screenscraping) 
suddenly aborts, for instance:

Sleeping... 3 seconds...
    Reading Chocolate_cake.gif-Firecrackers.gif 3979 bytes 14 subpages 0 pages
Sleeping... 3 seconds...
    Reading Firm_Tofu.gif-Ken_cake.gif 3979 bytes 14 subpages 0 pages
Sleeping... 3 seconds...
Traceback (most recent call last):
  File "dumpgenerator.py", line 1202, in <module>
    main()
  File "dumpgenerator.py", line 1193, in main
    resumePreviousDump(config=config, other=other)
  File "dumpgenerator.py", line 1027, in resumePreviousDump
    titles = getPageTitles(config=config)
  File "dumpgenerator.py", line 249, in getPageTitles
    titles = getPageTitlesScraper(config=config)
  File "dumpgenerator.py", line 221, in getPageTitlesScraper
    raw2 = urllib2.urlopen(req).read()
  File "/usr/lib/python2.6/urllib2.py", line 126, in urlopen
    return _opener.open(url, data, timeout)
  File "/usr/lib/python2.6/urllib2.py", line 397, in open
    response = meth(req, response)
  File "/usr/lib/python2.6/urllib2.py", line 510, in http_response
    'http', request, response, code, msg, hdrs)
  File "/usr/lib/python2.6/urllib2.py", line 429, in error
    result = self._call_chain(*args)
  File "/usr/lib/python2.6/urllib2.py", line 369, in _call_chain
    result = func(*args)
  File "/usr/lib/python2.6/urllib2.py", line 595, in http_error_302
    self.inf_msg + msg, headers, fp)
urllib2.HTTPError: HTTP Error 302: The HTTP server returned a redirect error 
that would lead to an infinite loop.
The last 30x error message was:
Moved Temporarily

Original issue reported on code.google.com by nemow...@gmail.com on 15 Nov 2013 at 12:14

GoogleCodeExporter commented 8 years ago
Error seems to be the same for all of them.

Original comment by nemow...@gmail.com on 15 Nov 2013 at 12:29

GoogleCodeExporter commented 8 years ago
To clarify, this redirect error happens with other (misconfigured?) 
wikis/webservers, but I suspect there can be many different underlying reasons 
and no way to address them with a general solution. It would be nice enough to 
find a solution for wikkii.

Original comment by nemow...@gmail.com on 31 Jan 2014 at 3:14

GoogleCodeExporter commented 8 years ago

Original comment by nemow...@gmail.com on 27 Feb 2014 at 12:11

GoogleCodeExporter commented 8 years ago
same error on dumping http://www.wiki-aventurica.de/index.php

Original comment by afkretzs...@googlemail.com on 29 Mar 2014 at 9:20