Open DmitryLitvintsev opened 1 year ago
But isn't PinManager designed to handle restarts
PinManager will not resume processing pinning requests that were created but not finished before the PinManager shut down, if that is what you are asking. These kinds of requests should be deleted by PinManager after it restarts once their short lifetime expires (temporarily set during pin creation).
In any case, these should be transient errors due to a high number of parallel requests to the pins database table at a certain point. Maybe PinManager was busy with lots of reissued requests that were stored in SrmManager (which does persist requests over restarts and will retry them), many new requests or maintenance tasks -- hard to say without more information.
PinManager, or in this case either SrmManager or the client issuing the srm requests should retry them if they are new pin requests. If they are indeed old ones that were attempted to be flagged for removal (setting them to READY_TO_UNPIN
), then PinManager will try again.
This then goes back to my questioning - why do we even need PinManager if it does not recover pin processing and we relies that client will eventually retry. But that is a larger discussion. You think the stack trace should be fixed though?
Yes, the stack trace should be fixed. I'm looking into it.
When dCache was been restarted after upgrade to 8.2.20 the following stack trace seen (it is not an ongoing issue - I surmise these are pins held over since before restart. But isn't PinManager designed to handle restarts)