DS4SD / docling-parse

Simple package to extract text with coordinates from programmatic PDFs
MIT License
29 stars 8 forks source link

Expose sanitize cells via python #56

Open PeterStaar-IBM opened 1 week ago

PeterStaar-IBM commented 1 week ago

To use the original pdf-content, we need to expose the method to sanitize pdf-cells via the high-level python API.