Closed haoliplus closed 8 months ago
I'm attempting to reproduce the issue. It did not fail with the same command on an M1 Max. How much RAM does your M2 pro have?
It did fail on an M1 mini with 8 GB...presumably it's an OOM issue but the message is not helpful
I'm attempting to reproduce the issue. It did not fail with the same command on an M1 Max. How much RAM does your M2 pro have?
The memory of my M2 pro is 16GB. This error does not occur in mlx0.3, suggesting that memory may not be the main cause, I think. And I have two devices(16GB M2 air and 16GB M2 Pro mini). The error occurs on both devices.
Hmm actually I was able to repro the bug in 0.3 and 0.4. I believe the fix is in https://github.com/ml-explore/mlx/pull/752
commannd(mlx-example: 47dd6bd17f3cc7ef95672ea16e443e58ce5eb1bf)
error(all):
My environment:
I attempted to locate the error by adding a log, and I found it in
vae.py:Decoder:__call__
. However, when I tried to investigate further by adding another log(addprint(x)
at this line), the error disappeared. It seems that some variable is released too early?