coriane / jwpl

Automatically exported from code.google.com/p/jwpl
0 stars 0 forks source link

Some revisions are missing from the created revision SQL file (for Wiktionary-DE) #105

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. Create database config using ConfigGUI (default values, use SQL export, 
include all namespaces)
2. Run the DiffTool w/ German Wiktionary revision dump (can be provided)
3. Import the created SQL dump.

What is the expected output? What do you see instead?
http://de.wiktionary.org/w/index.php?title=MediaWiki:Mainpage&action=history 
shows 14 revisions.
In the database are only 12. The revisions RevisionID=2519 and 14994 are 
missing for ArticleID=7. About 20,000 revisions are affected in total.

What version of the product are you using? On what operating system?
v0.9.2; linux

Please provide any additional information below.
It seems like revisions that do not change the length of the text are missing.

Original issue reported on code.google.com by chmeyer.de on 1 Nov 2012 at 2:23

GoogleCodeExporter commented 9 years ago
According to my experiments, 25,452 revisions are missing. The total number of 
revisions that do not change the text length is 106,030 - this includes edits 
that do not change the text at all AND edits that do not change the text 
length. Unfortunately, I'm not able to count the former, but it is likely that 
they account for the 25,452(?)

Original comment by chmeyer.de on 1 Nov 2012 at 2:40