Open PhilipVinc opened 3 months ago
This is the error I get. I can also share a reproducer if wanted.
Traceback (most recent call last):
File "/mnt/beegfs/project/ndqm/test_luca/time_evolution.py", line 570, in <module>
obs_dict = solve_variational_evolution(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/project/ndqm/test_luca/time_evolution.py", line 299, in solve_variational_evolution
step_function = integration_algorithm(dt, H, exp_x)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/project/ndqm/test_luca/time_evolution.py", line 363, in step_explicit_O2
exp_z = nkj.operations.get_apply_exp_diagH(Hd)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/netket_pro/jumps/operations/exact_ops_on_FrozenExtendedNet.py", line 41, in get_apply_exp_diagH
i, j = ij.T
^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/numpy/lax_numpy.py", line 630, in transpose
return lax.transpose(a, axes_)
^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/lax/lax.py", line 986, in transpose
return transpose_p.bind(operand, permutation=permutation)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/core.py", line 387, in bind
return self.bind_with_trace(find_top_trace(args), args, params)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/core.py", line 391, in bind_with_trace
out = trace.process_primitive(self, map(trace.full_raise, args), params)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/core.py", line 879, in process_primitive
return primitive.impl(*tracers, **params)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/beegfs/workdir/filippo.vicentini/mambaforge/envs/ENV_NAME/lib/python3.11/site-packages/jax/_src/dispatch.py", line 86, in apply_primitive
outs = fun(*args)
^^^^^^^^^^
jaxlib.xla_extension.XlaRuntimeError: INTERNAL: ptxas exited with non-zero error code 11, output: : If the error message indicates that a file could not be written, please verify that sufficient filesystem space is provided.
--------------------
For simplicity, JAX has removed its internal frames from the traceback of the following exception. Set JAX_TRACEBACK_FILTERING=off to include these.
/mnt/beegfs/project/ndqm/test_luca/time_evolution.py:542: UserWarning: Data has no positive values, and therefore cannot be log-scaled.
ax.set_yscale("log")
/mnt/beegfs/project/ndqm/test_luca/time_evolution.py:559: UserWarning: Data has no positive values, and therefore cannot be log-scaled.
ax[0,i].set_yscale("log")
Thanks for raising this. Yes, can you share a reproducer?
The error talks about filesystem issues ("If the error message indicates that a file could not be written, please verify that sufficient filesystem space is provided"). Could there be a permissions issue?
Description
I am consistently getting an error out of a compilcated code
after having installed jax/jaxlib with on a clean environment.
I also made sure that in my
LD_LIBRARY_PATH
nothing is set.Is there some way to debug this in any way?
System info (python version, jaxlib version, accelerator, etc.)
(the Nvidia SMI that is being picked up is from the cluster installation, but cuda is not in my path