How to Determine Whether an Input SQL Query Needs Optimization? What Happens to SQL Queries That Do Not Require Optimization?

Thanks for the question! In practice we rely on both the LLM itself and our demonstration selection to deal with optimal input queries.

Firstly, in our demonstration pool, we have demonstrations that are optimal queries where no rewrite rules are applied to these demonstration queries. (in practice we align their outputs to be 'empty rule' for LLM to better understand) So when a potentially optimal query is given, the demonstration selected will highly likely to be an optimal demonstration without rewrite. In our experiments, many queries, especially from the easiest JOB workload, are already optimal and we manage not to rewrite them.

Secondly, we find that even in the zero-shot stage, LLM is able to identify some of the optimal queries and choose not to give additional rewrite rules. So this will be an insurance for the method to not rewrite optimal queries.

Thanks again for your interest and we hope this reply clarifies your doubts!

DAMO-NLP-SG / LLM-R2

How to Determine Whether an Input SQL Query Needs Optimization? What Happens to SQL Queries That Do Not Require Optimization? #1