Closed jorgemoralespou closed 5 days ago
What did you see when you run:
% kubectl get workshopsessions
NAME URL USERNAME PASSWORD STATUS MESSAGE
educates-cli-w03-s001 Failed Unable to create workshop session objects, failed creating object localhost-educates-cli-w03-s001 of type xService in namespace educates-cli-w03 for workshop session educates-cli-w03-s001.
after error from session.objects
?
Or when listing workshopenvironments
for environment.objects
or workshopallocations
for request.objects
.
FWIW, in default config for kopf it would create events for a rediculous amount of log messages which created noise and causes events to be garbage collected more quickly making events overall pretty useless. This was therefore dialed back.
Although messages is captured in status message of these custom resources, some like workshopsession
and workallocation
are themselves potentially shorted live and get cleaned up after the workshop session startup timeout, session is orphaned, explicitly terminated or workshop duration is expired. So do accept will be lost perhaps quicker than events, which typically might last an hour at least if not garbage collected sooner due to volume.
The kopf framework does provide an API for generating explicit events from an operator, so can add that for these in addition to adding them to status of the respective custom resources.
These errors are also reported as part of the analytics event stream, which would be the only viable path for permanent capture using an external system.
Actually, analytics events are not reporting for these specific errors at the moment.
Checking, there should already have been an event for this error without needing to change anything. Something like:
0s Error Logging workshopsession/educates-cli-w02-s002 Handler 'workshop_session_create' failed permanently: Unable to create workshop session objects, failed creating object registry-educates-cli-w02-s002 of type xService in namespace educates-cli-w02 for workshop session educates-cli-w02-s002.
My bad. Closing this issue as there's already events that somehow I missed to see
Is your feature request related to a problem? Please describe.
When created a cluster without kapp-controller and then deployed a workshop which had a dependency on kapp-controller, workshops were not spinning up but one could not immediately know what was the cause of the error. When looking at the logs of session manager, one could see this:
Describe the solution you'd like
Last line being the more obvious one:
It'll be ideal if this could be surfaced to the user in a more prominent way, maybe via an event.
Describe alternatives you've considered
No response
Additional information
No response