jrmuizel / pdf-extract

A rust library for extracting content from pdfs
368 stars 75 forks source link

Performance: use nom_parser in lopdf instead of pom_parser #33

Open badicsalex opened 2 years ago

badicsalex commented 2 years ago

Apparently lopdf is also changing to nom_parser as a default, but since this is forced in pdf-extract's Cargo.toml, it should be modified there too. The performance change is dramatic, for one of my sample files I get a 60%+ reduction in runtime.

See https://github.com/J-F-Liu/lopdf/issues/157