alan-turing-institute / CleverCSV

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
https://clevercsv.readthedocs.io
MIT License
1.24k stars 70 forks source link

Allow avoidance of Pandas dependency #101

Closed sanmai-NL closed 9 months ago

sanmai-NL commented 1 year ago

Pandas is a large dependency. That‘s incompatible with using CleverCSV as pre-commit hook under CI. I have scanned the source code and this does not take away my intuition that Pandas isn't strictly needed for CleverCSV. Alternatively or additionally, could you transition to Polars instead of Pandas? That has subpackages.

GjjvdBurg commented 9 months ago

Thanks for calling this out, it's fixed in v0.8.2. The pre-commit hook has been updated to depend only on the clevercsv[precommit] installation target. This target only has the required dependencies needed for the precommit hook (details), which does not include pandas.

You can update your .pre-commit-config.yaml file with rev: v0.8.2 to use the update.