Open ghost opened 10 years ago
From rdeltour@gmail.com on September 03, 2012 14:06:24
Status: Accepted
From rdeltour@gmail.com on September 07, 2012 05:09:52
I did face the same issue recently, although I can't reproduce it easily. I'm still interested in a systematic repro if anyone can come up with one.
From rdeltour@gmail.com on September 07, 2012 05:33:11
Yay, I did find a repro after all:
ps
felix shell command)update 45
)The issue is probably that the bundle forgets to unregister itself from the EventBus when it stops, and when running the script later the bus tries to dispatch the message to load the class to a disposed bundle wiring.
We need to double check every activate/deactivate service methods to make sure that earlier registrations are properly unregistered.
From rdeltour@gmail.com on September 07, 2012 05:37:02
Owner: rdeltour@gmail.com
Benetech have found a workaround so this issue isn't super important. However it seems Romain has found a repro so it doesn't take much effort to check if it is still an issue. If possible it would be good to add a Pax-Exam test.
From john.bru...@gmail.com on July 19, 2012 17:39:02
What steps will reproduce the problem?
What is the expected output? What do you see instead?
I expect a book that was converted successfully to be successful again. Instead, if the web service had gotten hung up by another book, then after the restart, the conversion fails. Somewhere along the conversion process there will be an error like this (more complete stack trace in the attached log file):
The only reference to this kind of error I found in Googling was an existing issue with Felix, https://issues.apache.org/jira/browse/FELIX-3477.
What version of the product are you using? On what operating system?
Pipeline2 v1.3-beta, running on Ubuntu 11
Please provide any additional information below.
The attached log file is the console log over the course of many restarts. The error can be seen first happening after the service is restarted at timestamp 19:06:09, following a point where the service was hung for half an hour in the middle of a job.
Getting the web service to hang is easy to reproduce in the sense that it happens fairly often, after just a few books sent through, but hard to reproduce in that it may not happen in the same place, with the same book. One situation that appears could trigger this problem is sending through a large book that exhausts the heap.
What we have found as a workaround is to delete all of the files in data/felix-cache before restarting after a hung system. When we do this, we do not see the Felix errors.
Attachment: daisy-pipeline-felix-error.log
Original issue: http://code.google.com/p/daisy-pipeline/issues/detail?id=195