Closed InfluenceFunctional closed 7 months ago
It's quite extremely slow on the full dataset - surely there's something we can do here.
consider substituting pandas for polars
https://blog.jetbrains.com/dataspell/2023/08/polars-vs-pandas-what-s-the-difference/#:~:text=Pandas%2C%20by%20default%2C%20uses%20eager,way%20of%20executing%20the%20code.
may be some issues around less permissive data formats, e.g., pyarrow.lib.ArrowInvalid: Can only convert 1-dimensional array values
pyarrow.lib.ArrowInvalid: Can only convert 1-dimensional array values
duplicate with issue #81
It's quite extremely slow on the full dataset - surely there's something we can do here.