Open Jianghao-Li opened 1 year ago
Hi @liccoco, I mainly change the code in two parts:
Besides, I choose a smaller lr (--lrb) for batchformer module because the network is quite small for cgqa.
Feel free to ask if you have any questions.
Regards, Zhi Hou
Thanks for your reply. I found the code on github is different from the code in your paper, such as batchv1. The code in your paper i didn't find the module appear in your real code, could you please explain the meaning of y in your paper? Thanks
Hi @licoco, Thanks for your comment. That is actually the same as the description in the paper. It is just because the code of czsl utilizes a semantic graph to infer the zero-shot classes. y is the label, that is the same as the pairs in czsl. I do not implement an individual function for batchformer in czsl. But the current implementation is the same as the description, please refer to the two links that I provide in last comment. It is not necessary to change the code base of czsl to implement batchformer.
Regards,
Sorry to bother, i don't know how you insert batchformer to czsl tasks, could you please tell me more details? Thanks