IBM / data-prep-kit

Open source project for data preparation of LLM application builders
https://ibm.github.io/data-prep-kit/
Apache License 2.0
307 stars 134 forks source link

Update README docs for language transforms #800

Open dolfim-ibm opened 1 week ago

dolfim-ibm commented 1 week ago

Why are these changes needed?

Updates for the pdf2parquer, doc_chunk and text_encoder transforms.

Related issue number (if any).

https://github.com/IBM/data-prep-kit/issues/753