Closed andreyvelich closed 1 week ago
@andreyvelich: GitHub didn't allow me to assign the following users: helenxie-bit, quloos.
Note that only kubeflow members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time. For more information please see the contributor guide
@kubeflow/wg-training-leads It looks like numpy
2.0 was released yesterday: https://github.com/numpy/numpy/issues/24300.
Since torchvision
just installs the latest numpy
version, I am using numpy==1.26.0
version in our Trial images.
Otherwise, I see the following error from PyTorch:
A module that was compiled using NumPy 1.x cannot be run in
NumPy 2.0.0 as it may crash. To support both 1.x and 2.x
versions of NumPy, modules must be compiled with NumPy 2.0.
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: tenzen-y
The full list of commands accepted by this bot can be found here.
The pull request process is described here
It works now! Thank you so much! @andreyvelich
After this PR: https://github.com/kubeflow/katib/pull/2304, the
tune
API doesn't work correct.@helenxie-bit and @quloos Identified bug when using
tune
API. If user doesn't setenv_per_trial
parameter, the Experiment creation fails with this error:We should prioritise unit test PR for Katib SDK to help us detect invalid SDK: https://github.com/kubeflow/katib/pull/2325 cc @tariq-hasan
/assign @johnugeorge @tenzen-y @helenxie-bit @quloos