dCache / dcache

dCache - a system for storing and retrieving huge amounts of data, distributed among a large number of heterogenous server nodes, under a single virtual filesystem tree with a variety of standard access methods
https://dcache.org
276 stars 134 forks source link

hanging pools, heap dump fails #2595

Open calestyo opened 8 years ago

calestyo commented 8 years ago

Hey.

dCache 2.15.8.

FYI We've had some pools got hanging (see attached logs)... perhaps that helps you. A thread dump should be included.

Also

# dcache dump heap pool_lcg-lrz-dc07_0 pool_lcg-lrz-dc07_0.heap
Failed to resolve .

fails.

Cheers, Chris.

pool_lcg-lrz-dc07_0.log.xz.txt (This is not a txt but an .xz... github is stupid with its file endings)

gbehrmann commented 8 years ago

That's a different issue. You are likely in a directory in which the dCache users doesn't have write permissions.

gbehrmann commented 8 years ago

...and by a different issue I was referring to some other thread in which somebody had problems with dumping pools :-) Should have scrolled to the top to check if this was a new issue or not.

calestyo commented 8 years ago

That's a different issue. You are likely in a directory in which the dCache users doesn't have write permissions.

Ah,... yes I wasn't. But it's not clear (neither from the tool nor from the error message, that this is reuquired).

gbehrmann commented 8 years ago

Yeah, the original version of the patch would have written "pool_lcg-lrz-dc07_0.heap is not writable to user dcache", but there were concerns over that error message (see https://rb.dcache.org/r/9303/).

calestyo commented 8 years ago

Concerns? Now it says "Failed to resolve ." which is completely misleading… It could simply say "No permissions to write as user ." and everyone would probably get it.