Closed llm-96 closed 7 months ago
Sorry for the later reply. I think the main reason is that for cross-dataset test, we use our test-time optimisation strategy as described in the paper. Please check the config files that use test-time optimisation for testing, e.g. DT4D-H interclass.
Hi, thanks for the great work. I am running inference given your provided checkpoints. In Table 3, the result for (train-test) Faust-Faust, Scape-Scape matches with what you have reported. However, the cross dataset F-S, S-F is quite bad. F-S gains 6.66 (2.2 reported), and S-F gains 4.58 (1.6 reported). I leave my config file for F-S below. It would be great if you can take a look to see if there is a problem. Thank you.