Closed micklynch closed 1 year ago
I think I've narrowed the issue down to UMAP.
Also, in case it matters, I'm on an Intel Mac. Thanks!
I believe I found the solution by searching through UMAP's issue log. For those interested:
Setting the environmental variable
NUMBA_DISABLE_JIT
to 1 prevents the segfault.
Ah yeah, sorry for the late reply, UMAP + numba issues really do pop up now and again.
Awesome project, I've been meaning to check it out for a while.
I ran into this error when running
python prep-data.py
and wondered if anyone else encountered this issue.I reduced the dataset to ~200 sentences in case it was a memory issue.
I'm using Poetry for package mgmt.
Any tips or advice greatly appreciated.