Make tokenizer configuration dynamic to config file [TO BE DISCUSSED]

The tokenizer has to match the reward model and so this cannot be changed in isolation. It's basically the same as just changing the entire reward model.

However

We are using specific hyperparams in the tokenizer for things like padding and max sequence length. These could be investigated further.

# mirror_neuron/sources/reward.py
                encodings_dict = self.tokenizer(
                    sub_samples,
                    truncation=False,
                    max_length=550,
                    padding="max_length",
                    return_tensors="pt",
                )

We should make these configurable form the config file and not modify source code!

synapse-alpha / mirror-neuron

Make tokenizer configuration dynamic to config file [TO BE DISCUSSED] #64

However