IBM / data-prep-kit

Open source project for data preparation of LLM application builders
https://ibm.github.io/data-prep-kit/
Apache License 2.0
314 stars 135 forks source link

Html2parquet example #804

Closed touma-I closed 1 day ago

touma-I commented 1 week ago

Why are these changes needed?

This PR includes a example showing how html2parquet can be used in a notebook

Related issue number (if any).

https://github.com/IBM/data-prep-kit/issues/788

touma-I commented 1 week ago

@shahrokhDaijavad @sungeunan-ibm Before I forget and drop this thread, please review/edit/approve as you see fit. this is the notebook showing how to invoke the html2parquet from within a notebook that I created last week. It can be further evolved based on direct reviews. Thanks

touma-I commented 1 day ago

Superseded by another PR