nlmatics / nlm-ingestor

This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
https://www.nlmatics.com
Apache License 2.0
1.11k stars 160 forks source link

[PDF Ingestor] make sure key idx within the range of sorted freq keys #36

Closed baobo5625 closed 8 months ago

baobo5625 commented 8 months ago

Description of the change

make sure key idx within the range of sorted freq keys to void some ingestion failed for certain pdf files

Type of change

Related issues

Fix [#1]()

Checklists

Development

Code review