jax/tests/lax_scipy_sparse_test.py segfaults on GPU; other GPU test failures

skye commented 3 years ago

I'm unable to run all unit tests with jaxlib==0.1.60+cuda111. I suspect this is an issue for all GPU builds.

$ python3 -m pytest jax/tests/
================================================================================================================================== test session starts ===================================================================================================================================
platform linux -- Python 3.6.9, pytest-6.1.1, py-1.9.0, pluggy-0.13.1
rootdir: /home/skyewm/jax, configfile: pytest.ini
plugins: xdist-2.1.0, forked-1.3.0
collected 11931 items                                                                                                                                                                                                                                                                    

jax/tests/api_test.py ....................s.................ss.......................................................................................................................................s..............................s.......ss..........s................s........ [  2%]
...............................s......s................................s.ss.......s..........s...................                                                                                                                                                                  [  3%]
jax/tests/api_util_test.py ............                                                                                                                                                                                                                                            [  3%]
jax/tests/array_interoperability_test.py ..........sssssssssssssssssssssssssssssssssssssssssssssssssss                                                                                                                                                                             [  3%]
jax/tests/batching_test.py .......................................ssssssssssss...................................FF................................................................FF....................F...                                                                      [  5%]
jax/tests/callback_test.py .........                                                                                                                                                                                                                                               [  5%]
jax/tests/core_test.py ......................................................................................................................................................................................................................................                      [  7%]
jax/tests/custom_object_test.py ................                                                                                                                                                                                                                                   [  7%]
jax/tests/debug_nans_test.py ..........                                                                                                                                                                                                                                            [  7%]
jax/tests/doubledouble_test.py ..............................................................                                                                                                                                                                                      [  7%]
jax/tests/dtypes_test.py ......................................................................................................................................................................................................................................................... [  9%]
................................................................................................................................                                                                                                                                                   [ 11%]
jax/tests/errors_test.py ssss                                                                                                                                                                                                                                                      [ 11%]
jax/tests/fft_test.py ..........................................................................................                                                                                                                                                                   [ 11%]
jax/tests/generated_fun_test.py ........................                                                                                                                                                                                                                           [ 12%]
jax/tests/host_callback_to_tf_test.py ssssssssss                                                                                                                                                                                                                                   [ 12%]
jax/tests/image_test.py ssssssssssssssssssss..........................................                                                                                                                                                                                             [ 12%]
jax/tests/infeed_test.py ....                                                                                                                                                                                                                                                      [ 12%]
jax/tests/jax_jit_test.py ..............                                                                                                                                                                                                                                           [ 12%]
jax/tests/jax_to_hlo_test.py ..                                                                                                                                                                                                                                                    [ 12%]
jax/tests/jaxpr_util_test.py .....                                                                                                                                                                                                                                                 [ 12%]
jax/tests/jet_test.py ......s.......F........s..........ss...ss..s...s.ss..........sss.......s                                                                                                                                                                                     [ 13%]
jax/tests/lax_autodiff_test.py ..................................F....F.FFFFFFFFFFFFFFFFFFFFF..................................................................................................................................................................................... [ 15%]
.................................................................................................................................................................................................................................................................................. [ 17%]
............................                                                                                                                                                                                                                                                       [ 18%]
jax/tests/lax_control_flow_test.py .....................s......................................................................................................................................................................................................................... [ 20%]
...............................                                                                                                                                                                                                                                                    [ 20%]
jax/tests/lax_numpy_einsum_test.py ..............................................................................................................................................                                                                                                  [ 21%]
jax/tests/lax_numpy_indexing_test.py ............................................................................................................................................................................................................................................. [ 23%]
.................................................................................                                                                                                                                                                                                  [ 24%]
jax/tests/lax_numpy_test.py ..................................................................................s...............................................s.s................................................................................................................. [ 26%]
...................................................FFFFFFFFFF..............................................................................................................................................................................................................sssss.. [ 28%]
.......................................................................................................................................................................................s.........sssssss.......................................................................... [ 30%]
.................................................................................................................................................................................................................................................................................. [ 33%]
.................................................................................................................................................................................................................................................................................. [ 35%]
.................................................................................................................................................................................................................................................................................. [ 37%]
.................................................................................................................................................................................................................................................................................. [ 39%]
.................................................................................................................................................................................................................................................................................. [ 42%]
.......................ss..............................................................FFFFFFFFFF................................................................................................................................................................................. [ 44%]
.................................................................................................................................................................................................................................................................................. [ 46%]
..........................sssss......................................................................................................................................s............................................................................................................ [ 49%]
.......................................................................................................................................................s..ss..s..s..ss..s.s.sss.ss.s.s..ss..s..s..ss..s..s..ss..s..s..ss..s..s.s..s....s..ss..s..ss.ss..s..s..s........s...s..s..s [ 51%]
......s..s........s.s..s....s.s..s..s..ss..s..s..ss..s..s.s..s....s..ss..s..s..ss..s....s.s..s....s.s..s.s.sss.ss.s.s.s..ss..s.sss.ss.s.s..ss..s..s..ss..s..s..ss..s....s.s..s.s.sss.ss.s.s.sss.ss.s..s.s..s.s.sss.ss.ss.sss.ss.s.s.s..ss.....s.s..s.ss....s..ss.s.sss.ss.ssss.ss. [ 53%]
s.s.s..s....s.s..s......s...s..s..ss..s..s..ss..s..s..ss..s.......s..s..s..ss.s.ss..ss....s.s.sss.ss.ssss.ss.s.s..ss..s..s..ss..s..s..ss..s..s..ss..s..s.s..s....s.s..s...s.sss.ss.s.s..ss..s.s.sss.ss.ss.sss.ss.ss.sss.ss.ss.sss.ss.ss.sss.ss.s.s.s..ss.....s.s..s.s.sss.ss.ss... [ 56%]
.s.....s.s..s...s.sss.ss.ss....s....s.sss.ss.s...s.s..s....s.s..s..s..ss..s..s.s..ss..ss..ss....ss..ss..s.s....s.......s.s..s..s..ss..s..s..ss..s.ssssssssss.s..ss..s.ss..ss..s.ssssssss...s..ss..s..s..ss..s.s.sss.ss.s...s.s..s.                                                 [ 57%]
jax/tests/lax_numpy_vectorize_test.py ............................                                                                                                                                                                                                                 [ 58%]
jax/tests/lax_scipy_sparse_test.py ssssssssss......ssssssssssssssssss.......Fatal Python error: Segmentation fault

Thread 0x00007fb2a68ec740 (most recent call first):
  File "/home/skyewm/jax/jax/interpreters/xla.py", line 356 in backend_compile
  File "/home/skyewm/jax/jax/interpreters/xla.py", line 292 in xla_primitive_callable
  File "/home/skyewm/jax/jax/_src/util.py", line 191 in cached
  File "/home/skyewm/jax/jax/_src/util.py", line 198 in wrapper
  File "/home/skyewm/jax/jax/interpreters/xla.py", line 242 in apply_primitive
  File "/home/skyewm/jax/jax/core.py", line 628 in process_primitive
  File "/home/skyewm/jax/jax/core.py", line 282 in bind
  File "/home/skyewm/jax/jax/core.py", line 363 in eval_jaxpr
  File "/home/skyewm/jax/jax/core.py", line 152 in jaxpr_as_fun
  File "/home/skyewm/jax/jax/_src/lax/control_flow.py", line 2234 in _custom_linear_solve_impl
  File "/home/skyewm/jax/jax/core.py", line 628 in process_primitive
  File "/home/skyewm/jax/jax/core.py", line 282 in bind
  File "/home/skyewm/jax/jax/_src/lax/control_flow.py", line 2224 in custom_linear_solve
  File "/home/skyewm/jax/jax/_src/scipy/sparse/linalg.py", line 622 in gmres
  File "/home/skyewm/jax/tests/lax_scipy_sparse_test.py", line 271 in test_gmres_on_identity_system
[...]

Looks like there are other test failures too, but they didn't print due to the segfault. cc @hawkinsp