Open catmaps opened 1 year ago
hello @catmaps
yes, I am continuing to 'improve' my fork of xowa
This fork has diverged somewhat from xowa, to the point where the datafiles uploaded by @gnosygnu would no longer be usable.
My own internet connection is not fast (down 16mbps, up 2mbps).
As an example en.wikipedia.org is now roughly 206 files, 440Gb in size (including 327Gb images) whereas species.wikimedia.org is roughly 33 files 5.8Gb (including 3.35Gb of images)
Most of the work I have been doing is to try to make the pages for a wiki identical to the original (although edits to the page after a dump need to be taken into consideration) using the http interface not the gui side
One of the most recent changes is the introduction of a new skin (Vector-2022) now introduced to enwiki but not specieswiki (yet)
Another area is the images (the data files just increase, there is no deletion of old content, or updating of images with newer images)
One way to go ahead is to find a mutually agreeable site to upload one (or more) of the smaller wikis to.
Another (if you are a programmer) is to build from scratch
What are your thoughts?
Thank you for your reply - Very interesting! I have some thoughts but I want to first think this over the weekend and then reply.
I am back. I have several different thoughts about images hosting and the ever-changing wikimedia templates etc. But first to better understand where we are at, I have a few questions:
hello @catmaps
to answer your points 1) yes - In the sense that new images are added when identified (by the html build of the wiki); however, this is an area I am trying to renovate. At the moment there is no attempt to update any images, and no attempt to remove unnecessary/unused images. I am experimenting with adding columns to allow timestamping the images.
I can see the benefit of adding images (from an up/down load perspective) but there also comes a time when the whole lot need to be refreshed (and reduced).
2) yes - as mentioned above, to identify the images, I do a batch build of html (mainly the Main namespace, but some of the other namespaces are built as well) - I have also changed how the html is stored, to save a small amount of space (and I believe time)
3) I have not been following the xowa dumps recently - I did not know that a Nov 2022 dump exists
4) I am sure there are other things, database rearrangement, additional tables and columns to name some
I am now using Java 17.2 - not for any of the specific new features, just to be reasonably current.
When it comes to looking at the full image. Xowa is intended essentially as an offline copy of a wiki including images (where these images are thumbnails intended for use in the html pages). I believe that looking at a link like en.wikipedia.org/wiki/File:Snowflake_macro_photography_1.jpg should go directly to the real website ().
For the image discussed Snowflake_macro_photography_1.jpg, although the link uses en.wikipedia.org within the page (on the real site) is the line 'This is a file from the Wikimedia Commons. Information from its description page there is shown below.'
I see from the snapshot, that in the list of wikis commons.wikimedia.org is not listed - I wonder if downloading that wiki would make a difference?
Just a brief reply until I can look into it deeper: Your reply to numbers 1 and 2 is helpful. . . . I am still trying to test some of my ideas for having current images. For number 3, I was referring to the regular wiki dumps at https://dumps.wikimedia.org/enwiki/ Thanks for the idea on 4, I will do some testing including a commons wiki next week.
I am back. I am still building/testing the Commons wiki addition. But here are a few general ideas/considerations on Xowa.
Hello desb42. I want to reach out and say "Hi." A while ago you had helped out me at https://github.com/gnosygnu/xowa/issues/829
I gave Gnosygnu feedback from 2016 to 2020. But he is now very quiet--I don't know if he still plans to resume someday.
But today I noticed that you are continuing to improve your fork of Xowa. 👍
I would be glad to hear how you wish to improve in MyXowa.
If interested, I could offer feedback and testing and suggestions.