Persona overlap between train, test and valid

Hello

Train, test and valid personas (or "tasks" ) are computed by

train = p.get_personas('train')
test = p.get_personas('test')
valid = p.get_personas('valid')

The length of test is 100 which means there are 100 distinct personas. However, 99 of them are present in train and similarly 99 of 99 personas in valid are also present in train

In addition, the difference between valid and test is only by one persona (62) so they are almost the same task

Q1. Why every persona in test and valid are present in train ? I thought data present in train should not appear for both test and valid

Q2. Why do you make valid and test have almost the same personas ?

Thanks

HLTCHKUST / PAML

Persona overlap between train, test and valid #10