However this leads to issues on some cluster nodes, where Java dependencies are not properly referenced. Java is required for tabula-py to properly work.
To avoid this, proposal to include the pre-extracted csvs to the data folder.
Scripts and rules have been updated accordingly to drop the tabula-py dependency.
Checklist
[X] I tested my contribution locally and it seems to work fine.
[X] Code and workflow changes are sufficiently documented.
[X] Changed dependencies are added to envs/environment.yaml.
Changes proposed in this Pull Request
Checklist
envs/environment.yaml
.