Closed mtitov closed 3 months ago
ah, sorry, did check the log itself, but it has error message
1720645075.369 : agent_staging_output.0000 : 62621 : 140737144416000 : DEBUG : put bulk TMGR_STAGING_OUTPUT_PENDING: 1: agent_collecting_queue
1720657683.690 : agent_staging_output.0000 : 62621 : 140737144416000 : DEBUG : advance bulk: 1 [False, True, AGENT_STAGING_OUTPUT]
1720657683.747 : agent_staging_output.0000 : 62621 : 140737144416000 : ERROR : staging prep error
Traceback (most recent call last):
File "/ccs/proj/chm155/IMPECCABLE/miniconda/envs/rct/lib/python3.9/site-packages/radical/pilot/agent/staging_output/default.py", line 82, in work
self._handle_task_stdio(task)
File "/ccs/proj/chm155/IMPECCABLE/miniconda/envs/rct/lib/python3.9/site-packages/radical/pilot/agent/staging_output/default.py", line 181, in _handle_task_stdio
for line in stderr_f.readlines():
File "/ccs/proj/chm155/IMPECCABLE/miniconda/envs/rct/lib/python3.9/codecs.py", line 322, in decode
(result, consumed) = self._buffer_decode(data, self.errors, final)
UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 4080-4081: invalid continuation byte
There are two issues here: (a) the task should be failed on that error, and (b) the error should not happen. Can you please attach the task's stderr file?
At the time when logs were checked task has been finished for 1 hour and
agent_staging_output
module had this tracing event as the last one for this task (didn't progress further):Task's
stderr
file is relatively large (but seems thatstdout
file was processed just fine)and task wasn't reported back to the TMGR