if it is normal
...
--train_dataset.json_dataset.batch_size=4
--optimizer.bf16_accumulate_gradient=True \
--optimizer.accumulate_gradient_steps=1 \
...
so is it right?
...
--train_dataset.json_dataset.batch_size=8
--optimizer.bf16_accumulate_gradient=True \
--optimizer.accumulate_gradient_steps=2 \
...
if it is normal ... --train_dataset.json_dataset.batch_size=4 --optimizer.bf16_accumulate_gradient=True \ --optimizer.accumulate_gradient_steps=1 \ ...
so is it right? ... --train_dataset.json_dataset.batch_size=8 --optimizer.bf16_accumulate_gradient=True \ --optimizer.accumulate_gradient_steps=2 \ ...