Open mrzv opened 9 years ago
Comment by Dmitriy Morozov Monday Mar 02, 2015 at 21:38 GMT
I've never run it in this regime, so it's not a known problem. But it's not difficult to guess what may be going wrong. There is a division by size()
in flush()
. That will definitely fail (since size is 0). There may be other problems.
I should note that this is not a regime I've thought about before. Other things might be failing as well.
Comment by Hadrien Croubois Monday Mar 02, 2015 at 21:46 GMT
I understand that's not something you might expect, still you might run into it in specific cases :
I should note that this is not a regime I've thought about before. Other things might be failing as well.
That's what I'm looking at
Comment by Dmitriy Morozov Monday Mar 02, 2015 at 21:54 GMT
Oh, I don't question the usefulness of the setting. Just acknowledging that I haven't thought about it before.
I'll fix this problem when I get a chance. Meanwhile, you can see if something simple, like setting out_queues_limit = 0
in flush()
, if size() == 0
solves it for you.
Comment by Dmitriy Morozov Tuesday Mar 03, 2015 at 01:09 GMT
e63cbea might fix the immediate problem (in the way I described), but there are deeper problems with the design in this case. (DIY collectives break, as does the pattern of loading the data via collective IO during decomposition.) I need to think through this to figure out the best solution.
Issue by Hadrien Croubois Monday Mar 02, 2015 at 21:02 GMT
When the total number of block is smaller then the number of procecess running in parallele, calling diy::Master::exchange() gives an error
Is that a known problem ? I'll try and have a lot a where it comes from