Query about Data Augmentation in Meta World Environments

I'm interested in understanding the rationale behind not incorporating data augmentation techniques in the meta world environments. Specifically, I noticed that in the function self.reward_model.train_reward(), there isn't a parameter for data augmentation like data_aug_ratio, which is used in self.reward_model.train_reward_iter(num_iters).

Does this imply that methods without data augmentation perform better in meta world environments, or is there another reason for this design choice?

Thank you for your insights!

huxiao09 / QPA

Query about Data Augmentation in Meta World Environments #1