kubeflow / katib

Automated Machine Learning on Kubernetes
https://www.kubeflow.org/docs/components/katib
Apache License 2.0
1.51k stars 443 forks source link

[GSoC] Provide a PyTorch MNIST Example for Push-based Metrics Collection #2437

Open Electronic-Waste opened 1 month ago

Electronic-Waste commented 1 month ago

What this PR does / why we need it:

Provide a Pytorch MNIST example for push-based metrics collection.

Which issue(s) this PR fixes _(optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged)_: Fixes #

Checklist:

google-oss-prow[bot] commented 1 month ago

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: Once this PR has been reviewed and has the lgtm label, please assign andreyvelich for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files: - **[OWNERS](https://github.com/kubeflow/katib/blob/master/OWNERS)** Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
review-notebook-app[bot] commented 1 month ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

tenzen-y commented 1 month ago

@Electronic-Waste Could we implement CI powered by papermill first in a separate PR?

Electronic-Waste commented 1 month ago

It seems that #2417 mentioned it and has already been taken by someone. I'll ask the assignee about the progess. And if he doesn't have time, I would be glad to take it :)

Please let me know your thoughts. cc @tenzen-y @andreyvelich

tenzen-y commented 1 month ago

It seems that #2417 mentioned it and has already been taken by someone. I'll ask the assignee about the progess. And if he doesn't have time, I would be glad to take it :)

Please let me know your thoughts. cc @tenzen-y @andreyvelich

As Andrey mentioned in the issue, we are working on it at the traininig-operator, first. So, let us come back here after the papermill implementation is completed in the training operator.

Electronic-Waste commented 1 month ago

Yeah, I see! I'm looking forward to enhance my understanding of Training-Operator through this issue :)