Open Dianaia opened 1 month ago
Hi @Dianaia,
Thanks for your feedback and great question.
Actually, there's no contradiction.
I recommend following our instructions in practical use and employing the "vertical and slash" pattern entirely. Our tests have shown that this approach performs well across different models, sizes, and tasks.
Got it, I understand now. Thank you again for your explanation and outstanding work.
Confusion about Optimal Search Pattern Configuration
First of all, thank you for your outstanding research. I noticed that in Appendix E of the paper, it is mentioned that "according to the ablation study, using only the Vertical-Slash pattern significantly impacts performance in highly dynamic tasks like KV retrieval." However, the model configuration provided in the repository still uses the Vertical-Slash pattern exclusively. You mentioned in other comments that "the search_pattern function reroutes to vertical_and_slash because our tests have shown that this setting offers better generalization and efficiency across different context windows and tasks." This seems to contradict the conclusion given in the paper, which leaves me somewhat confused. Could you please clarify how we should set the optimal search pattern in practice?