m3dev / gokart

Gokart solves reproducibility, task dependencies, constraints of good code, and ease of use for Machine Learning Pipeline.
https://gokart.readthedocs.io/en/latest/
MIT License
305 stars 57 forks source link

Proposing new feature: dump dependent Tasks information as a table #231

Closed mski-iksm closed 3 years ago

mski-iksm commented 3 years ago

I thought dumping a dependent tasks information table after running tasks is convenient for post analysis.

Task cache information table is a pandas dataframe which has information such as task name, unique id, cache file path, parameter and processing time of completed tasks. Implementation was considered in https://github.com/m3dev/gokart/pull/176, however the PR was closed due to difficulty in behavior setting.

mski-iksm commented 3 years ago

~~I'll first implement this feature in gokart.build(). Implementation to gokart.run() will be done in different issue/PR, since gokart.run() has difficulty in parameter settings.~~