Open thusinh1969 opened 1 month ago
Counting untrained tokens: 50%|█████████████████████████████████████▊ | 66000/132738 [05:19<05:28, 203.40 examples/s]
It says 5:28 but in fact it took 25 minutes more !
It always took sometime up to 1 hour or so do do this. On large few millions rows, it takes also forever and repeating everytime we restart or resume.
What is it doing on counting what tokens and why ? Can we turn it off ?
Thanks, Steve
@thusinh1969 Maybe this is related: https://github.com/unslothai/unsloth/issues/658#issuecomment-2175416360
Counting untrained tokens: 50%|█████████████████████████████████████▊ | 66000/132738 [05:19<05:28, 203.40 examples/s]
It says 5:28 but in fact it took 25 minutes more !
It always took sometime up to 1 hour or so do do this. On large few millions rows, it takes also forever and repeating everytime we restart or resume.
What is it doing on counting what tokens and why ? Can we turn it off ?
Thanks, Steve