aphp / edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
https://aphp.github.io/edspdf/
BSD 3-Clause "New" or "Revised" License
42 stars 6 forks source link

fix: allow recursive pdf file searching #26

Closed percevalw closed 8 months ago

percevalw commented 8 months ago

Description

Allow PDFs in subfolders to be found when edspdf.data.read_files(path)

Checklist

codecov[bot] commented 8 months ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 98.60%. Comparing base (22f20f2) to head (bc36a84).

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #26 +/- ## ======================================= Coverage 98.60% 98.60% ======================================= Files 46 46 Lines 3012 3015 +3 ======================================= + Hits 2970 2973 +3 Misses 42 42 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.