Open polyrabbit opened 3 weeks ago
Which metadata system is used? tikv,redis or sql?
Tikv, I suppose GC safe point
appears a lot in tikv engine.
@polyrabbit Can you try #5080?
Unfortunately #5080 still fails with:
2024/11/08 11:01:39.893074 juicefs[50149] <FATAL>: GC life time is shorter than transaction duration, transaction starts at 2024-11-08 10:50:43.874 +0800 CST, GC safe point is 2024-11-08 10:51:34.174 +0800 CST [main.go:31]
But this time it runs longer (13min+) than before, I suppose there is another txn opened too long?
Update: a second test works now, the progress shows it needs 10+ hours to finish, I'll wait to see if it succeeds tomorrow.
The difference between those two tests is that I rebased #5080 this morning, and the second test is I cherry-picked #5080 - I suppose there are some conflicts between those commits.
Also, I noticed backup spends lots of time on sorting large dirs: https://github.com/juicedata/juicefs/blob/c90a175d323a4f4593b3eaff7c750d3611417b28/pkg/meta/tkv.go#L2897 Is it necessary?
It took 7h+ to backup 318039153 files.
We are working on a faster dump into binary format, will let you know when it's ready
Why not consider merging #5080? Does it have any critical drawbacks? I suppose stream scan also benefits other cases.
We have a volume with 500M+ inodes, the metadata backup always fails with the following error: