huggingface / datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://huggingface.co/docs/datasets
Apache License 2.0
19.31k stars 2.7k forks source link

New feature type: Document #7058

Open severo opened 4 months ago

severo commented 4 months ago

It would be useful for PDF.

https://github.com/huggingface/dataset-viewer/issues/2991#issuecomment-2242656069