Samagra-Development / Text2SQL

20 stars 11 forks source link

Implement DSP #36

Open ChakshuGautam opened 1 year ago

ChakshuGautam commented 1 year ago

The next level of QA with your dataset cannot be done by embedding all of the data/semantics in a prompt. It has to be iterative and should only fetch relevant information from a retriever that allows for a natural language querying of additional context (data, metadata about the SQL schema, design concepts). This would lead to far more accurate responses from the system.

This would also lead to not embedding 2k lines of SQL schema as part of the prompt, reducing the need for a larger context window, and leading to cost reductions and an increase in speed.

The current task includes building a Colab notebook for