Open rossjones opened 1 year ago
still needed?
Sort of. This was really some thinking around Logs v2 where we move it out of the current download and into a place where it is published independently - that way the logs could be pushed to a org's chosen log platform, or just S3 or whatever. Happy to close this in favour of a logs v2 doc.
Still needed so far. Options we should consider:
Overall, I believe option 1 will solve most of the issues for now, and then we can explore option 2 in the future
When a job is complete, the output from that job (stdout, stderr and actual output) are available from the place where they are published, but no longer from the compute node. As a user, I want to see the logs for a job that completed earlier today and currently that is problematic, giving me only a summary unless I want to download the potentially large ouput from the job.
Alternatively we could download all of the output from the job, and from there access the stdout and stderr files containing the output from the job. We could improve this by enabling the downloading of single files from the output. This still feels like the wrong solution, as it enforces us having the logs live with the output data and this seems inflexible.
Instead, we should separate out long-term storage of the logs from the storage of the output data. Whereas currently they live in the output folder and are stored in the same location, we should enable configuration of a single storage provider for log files. Initially this could default to using IPFS and have the CID retrievable in the same way we retrieve the output CID. It is also worth considering whether we need the separation of logs (stdout/stderr) or whether we can store them in a single file.
TODO
logs
command to fetch from storage if execution completed.