tony-framework / TonY

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.
https://tony-project.ai
Other
708 stars 164 forks source link

Get task executor's python subprocess exit detailed diagnostics message #651

Closed zuston closed 2 years ago

zuston commented 2 years ago

Why

Currently, TonY only get the executor's exit code and then attach on the Yarn diagnostics panel. If we could get the detailed python exit detailed diagnostics message, it will be better to statistical analysis and debug.

Maybe the first step is to separate the python's process error and standard output to different files.

And then we could get the error msg if exit code is non-zero.

TODO