dfo-mar-odis / saraDataScraping

Repo to hold code and project management for the SARA data scraping project
MIT License
0 stars 0 forks source link

bare bones version of python class, has issues... #12

Closed stoyelq closed 2 years ago

stoyelq commented 2 years ago

Basic python code class to start working on #6

Currently has two methods: init takes in and save a file path scrape_pdf reads the pdf passed in init, sets a list of the tables. Major current issue is lots of garbage rows in the parsed data, will need significant cleaning somehow.