Open Nisekoi-1 opened 2 days ago
Due to this video, it's said that dataset longer than 30 minutes will begin to show boundary effects, but based on my own practice, I personally recommend the longer the better, even though it may be a bit difficult to distinguish the difference between the two.
Is the improvement in training quality worth the extra time required?