ipums / hlink

Hierarchical record linkage at scale
Mozilla Public License 2.0
12 stars 2 forks source link

Fix tests for Python 3.12 #151

Closed riley-harper closed 1 month ago

riley-harper commented 1 month ago

Fixes #150.

We need to install setuptools as a dev dependency so that pyspark can import distutils on Python 3.12. Python 3.12 removes distutils from the standard library, but pyspark still imports distutils. This touches on the fact that we "support" Python 3.12, but pyspark does not officially do so. So I added a bit to the README that clarifies that we can support Python 3.12 only so much as pyspark does. I added a couple of ideas to try if the users encounter issues with pyspark on Python 3.12. One of them is just to downgrade to Python 3.10 or 3.11, though.