Closed ashwin-balakrishna96 closed 6 months ago
By default I don't think we want to normalize interleaved datasets by their combined statistics -- I think this is a special case for DROID since the mixture is quite homogeneous.
Instead, I would suggest adding an argument to make_interleaved_dataset
that allows to overwrite the dataset_statistics
and just gets passed through to make_rlds_dataset
-- that seems more general.
Yep makes sense, I can fix this later today.
@kpertsch this is fixed now, let me know if there is anything else needed to merge. The default behavior should leave all normalization unchanged from what it was originally.
Moved most of the changes here to https://github.com/octo-models/octo/pull/62 (couldn't figure out how to modify this PR) -- closing this PR.
There are two main changes here:
1) Support to specify what trajectory keys to normalize, defaults to the original action and proprio keys. 2) When creating an interleaved dataset which mixes together multiple individual datasets, dataset statistics used for normalization are combined across all datasets.