Hi,
I found that the settings of learning rate, weight decay and lr scheduler of domain adaptation method are inconsistent. Is it necessary to set the same training parameters for different methods? And does the benchmark of different task algorithms provided in the document follow the same training parameters?
Hi, I found that the settings of learning rate, weight decay and lr scheduler of domain adaptation method are inconsistent. Is it necessary to set the same training parameters for different methods? And does the benchmark of different task algorithms provided in the document follow the same training parameters?