graphistry / pygraphistry

PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
BSD 3-Clause "New" or "Revised" License
2.17k stars 206 forks source link

[BUG] umap dirty_cat on colab #607

Open maksim-mihtech opened 3 weeks ago

maksim-mihtech commented 3 weeks ago

With the last version of Pygraphistry I get the following error while running g.umap()

graphistry_bug

lmeyerov commented 3 weeks ago

Thanks @maksim-mihtech

Any chance you can share the file of df.sample(100).to_parquet('nodes.parquet') ?

cc @silkspace

lmeyerov commented 3 weeks ago

Also, what versions of python, dirty_cat , sci-kit, and pandas are you on?

lmeyerov commented 3 weeks ago

@maksim-mihtech it sounds like there is a repro in colab, can you share how you installed it for the repro, pip install graphistry[umap_learn] ?

lmeyerov commented 3 weeks ago

More info:

redteam-umap-gtc-gpu.ipynb, and I installed graphistry(CPU)/graphistryai