Closed yzqzss closed 11 months ago
Several things:
wikiteam
and older versions of this project. That's not a good idea, since people are depending on that (at least I am in some of my code).This is the sort of thing where I will defer to everyone else.
In general I would say that anything breaking backwards compatibility should be dependent on implementation of build-versioning that would allow users to reliably target older versions.
Right now this repository does not even have version-tagged GitHub builds, let alone versioned PyPI builds, so at this point breaking backwards compatibility is a no-go.
As for drastically refactoring the code… that's fine, and probably for the better, as long as there is build-versioning in place to protect existing users.
Regarding format compatibility: I think introducing a new default format is fine as long as the existing upstream format continues to be supported alongside it for a substantial "bridge" period, with (a) an ability to convert existing dumps to the new format, and (b) there are strong "deprecation" nudges encouraging users to migrate.
Refactoring (and abstracting much of the backend) could of course facilitate this, hence why I'm supportive of refactoring more generally.
Basically what I'm saying is that introducing a new data format should be dependent on first establishing a stable public API for the backend, which currently does not exist.
This is too complex for my level of comprehension - except that this PR for wikiteam3 is not in that repository.
Here's my thought…
@yzqzss why don't you open a new Pull Request from an earlier commit on this branch? (I think you have to create a branch from that commit in order to do so.)
This would be much easier to approach if it wasn't a gigantic total total rewrite all at once, and breaking it into chunks this way would help.
If you're not interested in doing thing, to be fair, we could try and do so ourselves, but you're more familiar with your own code than we are.
Fixes
feats
--disable-image-verify
(https://github.com/mediawiki-client-tools/mediawiki-scraper/issues/170)--user-agent
to custom UA--ia-wbm-booster
Incremental image dump powered by web.archive.org (experimental)refactor
countless ...
drop legacy code
Special:Export
Breaking changes
False
tonull
.desc
xmlfile supportDrop launcher
Shifts compression responsibilities from the launcher to the uploader.
dependencies
refactor uploader
--parallel
to disable it)https://pypi.org/project/wikiteam3/