Closed trofi closed 1 year ago
Hello,
This function does not scale well Your code took ~4H on my PC
Using a large batchsize is not a good idea: with the defaults, it runs in 28min
https://github.com/markfasheh/duperemove/pull/322 allowed me to speed --batchsize=1000000
down to 1 minute.
Let's declare it done: #322 made it good enough for this test.
I think I have a reproducer script of a hanging
duperemove
. I initially wanted to use it to measure scalability bottlenect ofduperemove
, but looks like I got it to get stuck:Note: there is no progress over two hours. I think it should succeed in minutes (or tens of minutes worst). I ran it on compressed
btrfs
.