Regarding schema filtering

RUCKBReasoning / codes

The source code of CodeS (SIGMOD 2024).

Apache License 2.0

128 stars 18 forks source link

Hi. I am trying to understand how to use the schema filtering part. From the given entire schema of the database, how can I get the relevant tables and columns required for a given NL query? How should I prepare my data to get there? I have seen that the RoBERTa is used in schema_filter.py, but I cannot understand how the data needs to be prepared (I have done labeling, but the data is highly imbalanced; how do I approach this problem?) Can you please explain how schema filtering is handled in CodeS?

RUCKBReasoning / codes

Regarding schema filtering #19