After running this command I get the similar issue to the other issue (Issue #7).The current version of script contains the IterableDataset instead of Dataset, which makes the training with the multiple epochs impossible.
I should also note that the Issue #7 was resolved due to edited dataset creating.
Could you please specify whether are you training the BC with IterableDataset or not, and if you train with the multiple epochs, how many are you using in your experiments?
Hi @PioneerAlexander when we run this we only use one epoch. We use IterableDataset as it speeds up the initialization of the dataset. However, if you want to use multiple epochs you should use Dataset.
Hello,
For the training bc with chess full games I use the command line
Where the dataset is train_bc.jsonl
After running this command I get the similar issue to the other issue (Issue #7).The current version of script contains the IterableDataset instead of Dataset, which makes the training with the multiple epochs impossible. I should also note that the Issue #7 was resolved due to edited dataset creating. Could you please specify whether are you training the BC with IterableDataset or not, and if you train with the multiple epochs, how many are you using in your experiments?
I look forward to your response.