hawkfish / textform

A data transformation pipeline library based on Potter's Wheel.
MIT License
7 stars 0 forks source link

Add pdf support #9

Open hawkfish opened 3 years ago

hawkfish commented 3 years ago

This should wrap tabula-py or camelot-py.

PDF tables can be very messy, so being able to clean them up with filling and other shaping tools is very useful.

hawkfish commented 3 years ago

See How to Extract PDF Tables in Python