99moorem / wikiteam

Automatically exported from code.google.com/p/wikiteam
0 stars 0 forks source link

MediaWiki skins and resources not archived #82

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
When we archive wikis, it would be very nice to also try to archive the skin as 
well (css, images, etc?), in a separate folder

Original issue reported on code.google.com by dan...@danieloaks.net on 22 Jan 2014 at 3:36

GoogleCodeExporter commented 8 years ago
Maybe MatmaRex has suggestions on how to do this.

Original comment by nemow...@gmail.com on 31 Jan 2014 at 3:26

GoogleCodeExporter commented 8 years ago
Following a chat with him, I think we're just going to do something like this 
right after saving index.html: wget --page-requisites -e "robots=off" 
--no-directories --directory-prefix=requisites 
http://wiki.xkcd.com/wgh/index.php?debug=true

The example produces something like this:

Total wall clock time: 20s
Downloaded: 53 files, 375K in 1,6s (242 KB/s)
$ ls requisites/
15px-800px-Flag_of_Sweden.png                     
Checker-16x16.png?2013-11-23T23:33:20Z
15px-Flag_of_Chile.png                            
discussionitem_icon.gif?2013-11-23T23:33:20Z
15px-Flag_of_France.png                           
document.png?2013-11-23T23:33:20Z
15px-Flag_of_Germany.png                          
external-ltr.png?2013-11-23T23:33:20Z
15px-Flag_of_Mexico.png                           
feed-icon.png?2013-11-23T23:33:20Z
15px-Flag_of_Spain.png                            
file_icon.gif?2013-11-23T23:33:20Z
173px-1-26-2014_Humbucker_Hashpoint.jpg           
headbg.jpg?2013-11-23T23:33:20Z
173px-20140118_Cles_006.jpg                       
help-question.gif?2013-11-23T23:33:20Z
173px-2014-01-20_47_8_locked_twice.jpg            
help-question-hover.gif?2013-11-23T23:33:20Z
173px-2014-01-21_-35_149_14.42.10.jpg             Holidaylogo.png
173px-2014-01-21_42_-85_3.jpg                     index.php?debug=true
173px-2014-01-22_43_-116_train.jpg                
load.php?debug=true&lang=en&modules=mediawiki.legacy.commonPrint&only=styles&ski
n=monobook&*
173px-2014-01-23_43_-116_geohasher.jpg            
load.php?debug=true&lang=en&modules=mediawiki.legacy.shared&only=styles&skin=mon
obook&*
173px-2014-01-25_16.35.27.jpg                     
load.php?debug=true&lang=en&modules=site&only=scripts&skin=monobook&*
174px-2014-01-18_52_13_GeorgDerReisende_5370.jpg  
load.php?debug=true&lang=en&modules=site&only=styles&skin=monobook&*
174px-2014-01-19_52_13_GeorgDerReisende_5524.jpg  
load.php?debug=true&lang=en&modules=skins.monobook&only=styles&skin=monobook&*
175px-2014-01-19_43_-121_grins.jpg                
load.php?debug=true&lang=en&modules=startup&only=scripts&skin=monobook&*
175px-2014-01-20_34_-118_17-25-38-320.jpg         
lock_icon.gif?2013-11-23T23:33:20Z
175px-2014-01-20_44_-122_grins.jpg                magnify-clip.png
175px-2014-01-20_45_-122.JPG                      
mail_icon.gif?2013-11-23T23:33:20Z
175px-2014-01-26_50_8_hashgrin.jpg                
news_icon.png?2013-11-23T23:33:20Z
175px-2014-01-28_50_8_hashgrin.jpg                poweredby_mediawiki_88x31.png
180px-2009-04-25_49_-123.grouppose.JPG            
spinner.gif?2013-11-23T23:33:20Z
400px-Coordinates.png                             
tipsy-arrow.gif?2013-11-23T23:33:20Z
ajax-loader.gif?2013-11-23T23:33:20Z              user.gif?2013-11-23T23:33:20Z
audio.png?2013-11-23T23:33:20Z                    video.png?2013-11-23T23:33:20Z
bullet.gif?2013-11-23T23:33:20Z

Original comment by nemow...@gmail.com on 31 Jan 2014 at 8:01

GoogleCodeExporter commented 8 years ago
Note, I'd however delete the index.php* file to only keep index.html in the 
main directory and because otherwise we should redact the IP there too, as we 
do with index.html.

Original comment by nemow...@gmail.com on 31 Jan 2014 at 8:28

GoogleCodeExporter commented 8 years ago
It's not always as easy, i.e. 
https://bugzilla.wikimedia.org/show_bug.cgi?id=61249 suggests
    wget -e robots=off --page-requisites --convert-links --adjust-extension
--span-hosts --domains meta.wikimedia.org,bits.wikimedia.org meta.wikimedia.org

Original comment by nemow...@gmail.com on 14 Feb 2014 at 7:31