instructlab / sdg

Python library for Synthetic Data Generation
https://pypi.org/project/instructlab-sdg/
Apache License 2.0
24 stars 37 forks source link

[Epic] Fully Utilize Docling V2 Capabilities #374

Open ktam3 opened 1 week ago

ktam3 commented 1 week ago

Goals:

  1. Hierarchical Chunking
  2. All v2 document types are supported

Tasks: