Integration of further functionality

ml31415 commented 2 years ago

I recently had all sorts of issues converting a py2 db with 50GB of compressed grown data, including some unreadable pickles, compression errors and so on, into something readable for py3. In the end I had do patch quite some parts in zodbupdate and other tools in order to get the job done. The result is in the link below. I don't think it would be worth being published as a separate project, but it might be worth being integrated in your zodbtools. What do you think?

https://gist.github.com/ml31415/b4d00251e76afe35358070564864287a

navytux commented 2 years ago

Good day, Michael.

Thanks for sharing, this should be a useful functionality when one needs to migrate / verify such a database.

I had a quick look at the code and, as I understand, it provides the following functionality:

1) FTFileStorage that amends std FileStorage to: a) ignore format-level problems, such as non-zero version length, and b) ignore unloadable objects (treating them as not having references) when doing GC.

2) an FT storage iterator that skips objects that fail to load (do not exist, zlib error, IO error, whatever ...)

3) command to verify object records for being loadable/unpicklable ok.

4) command to fix unloadable/non-unpicklable object records by replacing them with PersistentBroken.

5) command to rebuild an index for FileStorage.

6) command to pack a database.

7) FTUpdater that amends zodbupdate.Updater to create transactions with more informative notes, and that can replace objects, that fail to load, with PeristentBroken + associated command to convert a database via FTUpdater.

This is a mixture of functionality, that in my view, would be better integrated into appropriate places in parts. If that is ok with you, here is what I would suggest:

Add a kind of "fix" storage wrapper, that would wrap arbitrary storage and provide similar functionality that FTFileStorage provides over FileStorage. For example the following zurl
```
fix:(/path/to/data.fs)(unloadable-as-broken)
```
would create such FixStorage over FileStorage(/path/to/data.fs) and configure the FixStorage to represent object records, that fail to load, with corresponding Broken objects.

Then, if we have such fix: storage, we can run whatever zodb commands, or zodbupdate conversion, over it.

For me the place for FixStorage is ok to be in zodbtools.

And for the reference, zurl syntax for repair follows zurl syntax for demo: storage - see e.g.

https://docs.pylonsproject.org/projects/zodburi/en/latest/index.html#demo-uri-scheme and https://lab.nexedi.com/kirr/neo/commit/4fb6bd0a
Add zodb pack command to zodbtools. For the reference we already have draft for pack here: https://lab.nexedi.com/kirr/zodbtools/blob/4b3b9791/zodbtools/zodbpack.py
Add zodb fs1 reindex command to zodbtools. This would match your reload_index functionality. For the reference zodbtools/go already have both verify-index and rebuild-index subcommands here:

https://lab.nexedi.com/kirr/neo/blob/789c9ed9/go/zodb/storage/fs1/fs1tools/index.go
For repairing an object, add something like zodb fixobj --as-broken command to zodbtools. For the reference we already have zodb catobj subcommand in zodbtools/go (https://lab.nexedi.com/kirr/neo/blob/789c9ed9/go/zodb/zodbtools/catobj.go) and corresponding draft for zodbtools/py (https://lab.nexedi.com/kirr/zodbtools/blob/b9070f8f/zodbtools/zodbcatobj.py).
For verifying the database, add something like zodb verify, that would verify the database both at storage level (what fs1 verify does today), and also verify objects payload for being unpickled ok, non-dangling references, etc.
For zodbupdate changes, actually create patches to that project for commit notes verbosity, and make sure that zodbupdate fix:... works as expected (e.g. it refreshes object represented by FixStorage as Broken and actually commits them into the database).

What do you think? Is that ok with you? If yes - then we can proceed with creating corresponding merge-requests step-by-step.

Kirill

P.S. regarding ignoring non-zero version length I'm not so sure it is ok to ignore FileStorage format problems. Non-zero version length might represent either corruption of data.fs, or actually an old record with version information present. If it is corruption of data.fs, then we should be checking and repairing other format errors as well, e.g. broken transaction record headers, broken data record headers, inconsistencies in between those, etc. For the reference FileStorage/go does that automatically and also there is fs1 verify command to explicitly run this check - see e.g. https://lab.nexedi.com/kirr/neo/commit/5a26fb31 .

But if it is actually an old record with version information present - then just ignoring non-zero version length would lead to loading object data payload from wrong position (and thus getting "bad" data for the object). Why it is like this should be clear from the following link:

https://github.com/zopefoundation/ZODB/commit/eef59eb35610#diff-25a69183c0b022dbd72bc0ff163682d82339b53c4ff0435b1cabddbf2145c413

So, I believe, if you actually hit those non-zero version lengths, it might be that many objects - that could be preserved - were turned to be Broken unnecessary.

I, of course, might be missing something, but that what comes to mind after checking quickly.

/cc @perrinjerome

ml31415 commented 2 years ago

Hi @navytux I agree, that this functionality ideally should be split up and be integrated in the according tools. I just put all together in one file / program as there is quite some overlap for the tools in order to make the object iteration fault tolerant. I'm not familiar enough with ZODB, if the used iteration approach can be used for any kind of ZODB storage, so that a general fix: ... wrapper can be implemented. So far I used it for zlib and ordinary file storages.

About fixing objects: I was mainly dealing with broken zlib objects. So it was unrecoverable data garbage for me, when unpacking failed, and I wanted just anything marking an error. Providing a hook for what to do with broken objects would probably be a good idea. In the ZODB the versioning feature had never been used and was just corrupted. So you're right, it should be covered in a better way if that's a concern.

What do you think? Is that ok with you?

All good for me :+1:

navytux commented 2 years ago

Hi @ml31415, thanks for feedback.

I understand that such kind of tools are usually one-shot, and that, after database is migrated, the interest to enhance the tool is usually no longer there. Still if maybe you want to proceed with upstreaming a bit I could give you some (imho) useful direction:

first I would try to go with easier parts: things like zodb pack, zodb fs1 reindex, zodbupdate verbosity, and, maybe zodb verify,
then, if "1" works ok, I would suggest to study ZODB.IStorage and related interfaces to see how/if FixStorage could be possible and done.

If that is ok, we could try to proceed further step-by-step in pull-requests. But, once again, I will understand that if the migration is already done and the tool is no longer used, there is not so much interest to continue working on it, and that you already tried to share your work via posting the gist and this issue.

Kirill

ml31415 commented 2 years ago

Hi @navytux I'm not sure how any changes would make sense without having the fault-tolerant FixStorage wrapper first. IMHO that's the main improvement, that changes to the other tools could make use of. Otherwise you'd just duplicate code for every tool separately I guess.

I wasn't planning on writing PRs myself right now, as you assumed correctly, I'm done with my week of grief, to get that stuff working at all, and I still got enough to do with the project I wrote the converter for. Might come back to that later, though. But as I said, welcome to take any code snippets you like. :)

navytux commented 2 years ago

Michael, I see; thanks for feedback.

perrinjerome commented 2 years ago

@ml31415 thanks for sharing this script, we will also soon be working on converting some ZODBs to python 3 and if we encounter problems I'll remember that there is this code.

@navytux I agree with everything you said and I also believe that these would be good additions to zodbtools that we can add if we find the need and/or the time.

PS: it's totally of topic but since we are discussing about wrapper storages and storage migration to python3, did you try to write a wrapper storage that would include zodbupdate logic to do the python3 conversion on the fly ? The idea would be to migrate to python3 without much downtime.

Nexedi / zodbtools

Integration of further functionality #1