Open stichbury opened 1 year ago
I've been playing with ChatGPT (https://chat.openai.com/chat) a bit recently and there's definitely some scope.
Recently, we've seen the ChatGPT project put out a beta of what I was mumbling about above. Thanks OpenAI 👯
While it's not yet ready, there are some ways we can prepare for a future where Kedro users turn to ChatGPT to answer their queries. A rough list:
Looks like this will be possible, Dagster built something like this: https://dagster.io/blog/chatgpt-langchain
This ticket should also probably link to #1649
@astrojuanlu Also this one up for discussion
Somebody pointed me into this direction: https://langchain.readthedocs.io/en/latest/use_cases/question_answering.html
Relevant tool https://www.mendable.ai/?s=03
More: https://docsbot.ai/
It's crazy how quick this space evolve. It's quite feasible to build one with langchain, you can also limited the context that it reads docs.kedro.org
and generate answer with relevant link only in the docs (so it's not making random stuff up).
"Don't replace your user community with an LLM-based chatbot" https://thisisimportant.net/posts/user-community-llm-chatbots/
Beware of pushback from the tech community https://github.com/mdn/yari/issues/9208
This is interesting https://docs.danswer.dev/introduction https://github.com/danswer-ai/danswer
"Don't replace your user community with an LLM-based chatbot" https://thisisimportant.net/posts/user-community-llm-chatbots/
I don't think it was ever a binary choice of either/or, was it? If the community want a knowledgebase, let's give it to them alongside current options...we're not proposing to remove anything.
@noklam and I worked on this as part of the Quantazio Hack. It would be good to continue the work as part of the ongoing docs effort.
If we ever get to this, probably we'd use some form of Retrieval Augmented Generation (RAG), see https://github.com/imartinez/privategpt
Related: publishing a custom GPT on Kedro, MLOps? https://help.openai.com/en/articles/8798878-building-and-publishing-a-gpt
We need a good way for Kedro users to get answers to their questions. Right now, they could search the discord linen archive or Slack for previous discussions but the UX isn't great. Or they can look at our written FAQ (which they do -- it's a popular page) but it doesn't deliver what they expect (it doesn't go into specific answers to specific questions). They probably end up on Google looking for answers and maybe StackOverflow.
It would be great to run a NLP chatbot that has been trained on our archives and documentation and can have a stab at an answer or link users to the right location to start their research. This is the holy grail of all documentation though, and does rely on a decent knowledgebase to train it, which we probably don't have (at least, we have content, but it's not clear whether it is suitable).
I think we need to first investigate the state of this kind of solution and then look at whether we can apply it to a Slackbot for Kedro.
This is early days, but here's a few links:
This issue is to seed some discussion and potentially earmark some time for research at a hackathon or similar spike.