gnosygnu / xowa

xowa offline wiki application
Other
374 stars 41 forks source link

dewiki null byte is invalid in byte array #541

Open desb42 opened 5 years ago

desb42 commented 5 years ago

I came across a few of these in building dewiki 2019-06-01

An example is de.wikipedia.org/wiki/Herzogin_Cecilie Using xowa-gui or xowa-http the problem does not directly occur. I think it happens in the build process as part of a more thorough link check

There is a visual difference with mediawiki as well

The wikitext at the root of this (in this case) is

[[StreitkrC3%A4fte Chiles#Marine (Armada de Chile)|chilenische Marine]]

I suspect a mistyping by the editor (my guess is %C3%A4 )

In xowa this displays as null1 and in mediawiki null2

It seems if the link is bad (bad in this case is not valid unicode) the whole link [[...]] is ignored

desb42 commented 4 years ago

another example of this manifests in ja.wikipedia.org/wiki/Wikipedia:削除依頼/JET_STREAM