elixir-nx / xla

Pre-compiled XLA extension
Apache License 2.0
83 stars 21 forks source link

Run with ROCM not working #42

Closed enzoqtvf closed 1 year ago

enzoqtvf commented 1 year ago

Hey team, sorry for the title, I didn't find a nice one.

My issue is highly related to: https://github.com/elixir-nx/xla/issues/29

I've made an attempt to build and run xla with rocm target, I followed the instruction on the above issue and I managed to build xla with rocm target which is great ! Here are the step I followed:

And after that I get this error and the elixir application exits:

terminate called after throwing an instance of 'std::bad_variant_access'
                                                                          what():  Unexpected index
                                                                                                   Aborted (core dumped)

I'm not sure if this is the right place to ask, but if anyone knows how I can solve this, or if there is a better way to do what I'm doing, maybe a specific repo/hash combination that does the job, or a different docker image that would help me a lot :)

Thank you very much, let me know if more information are needed :)

seanmor5 commented 1 year ago

Can you try bumping to EXLA 0.5.3?

enzoqtvf commented 1 year ago

@seanmor5 That was it ! Thank you very much for your help !