Open iamdoron opened 1 year ago
hi
thanks for your videos, just finished to watch the first part
when I tried to intersect between the test & train datasets I noticed some names repeat in the dataset
len(words) - len(list(set(words))) # 2539
it might create a bias in the test results and an additional small bias during training
hi
thanks for your videos, just finished to watch the first part
when I tried to intersect between the test & train datasets I noticed some names repeat in the dataset
it might create a bias in the test results and an additional small bias during training