deepset-ai / haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
https://haystack.deepset.ai
Apache License 2.0
14.51k stars 1.7k forks source link

JSON Convertor : Pipeline Component #7784

Open srini047 opened 1 month ago

srini047 commented 1 month ago

Is your feature request related to a problem? Please describe. Currently we have a .txt to Document convertor besides others and unstructured. But I see most of the data we deal with are in the form of JSON.

Describe the solution you'd like So a .json to Document convertor will be a bread winner while consuming API data in pipelines.

Describe alternatives you've considered Unstructured file convertor is present but JSON schema as a individual convertor adds more sense and value.

julian-risch commented 3 weeks ago

@srini047 Thank you for your suggestion. If you like, feel free to open a pull request. Our contributing guidelines are here.

arminnajafi commented 1 week ago

@julian-risch I like to start contributing Haystack. Do you think this can be a good starter? In that case, please feel free to assign it to me.

Thanks,

CarlosFerLo commented 3 days ago

@arminnajafi I have been contributing for a month or so, and they do not normally assign external people to issues, or at least that I have seen. If you want to do this one, feel free to open a PR, they will review it when you are ready.