NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment
Apache License 2.0
522 stars 58 forks source link

Ea/multiple validation sets #184

Open eloialonso opened 4 months ago

eloialonso commented 4 months ago

What does this PR do ?

Enable multiple validation datasets for reward model training. Metrics are computed individually on each dataset, and logged in separate wandb tabs.

Usage

You can add validation sets as new keys in data_prefix. These keys should start with "validation" to be taken into account as an optional validation set.

python examples/nlp/gpt/train_reward_model.py model.data.data_prefix={train: ["path_to_train_set"], validation: ["path_to_validation_set"], validation_optional: ["path_to_other_validation_set"], test: ["path_to_test_set"]}