Thanks for releasing such a high-quality and large-scale dataset. For the Two-Agent Task Completion (TATC) benchmark, the paper proposes a rule-based approach for both agents as the baseline. Do you have the plan to release the implementations of such rule-based agents? It would be very helpful to work on this benchmark with such baseline implementations.
The TATC code needs further internal review before we are able to release it, and this could take some time. I will keep this issue updated with the status on that.
Hi,
Thanks for releasing such a high-quality and large-scale dataset. For the Two-Agent Task Completion (TATC) benchmark, the paper proposes a rule-based approach for both agents as the baseline. Do you have the plan to release the implementations of such rule-based agents? It would be very helpful to work on this benchmark with such baseline implementations.
Thanks!