hwlim / BisKit-RNA

BisKit for CCHMC HPC users
0 stars 0 forks source link

handling empty values in sample.tsv #2

Closed hwlim closed 4 months ago

hwlim commented 5 months ago

Let's add one validation step checking empty values in sample.tsv

e.g. sometimes sample.tsv is incorrectly formatted or missing values.

Add a validation routine that check empty value before following lines.

69 for col in samples: 70 tmp = samples[col].str.count(r'(^[a-zA-Z0-9][a-zA-Z0-9-_.]+$)') 71 index_invalid = (tmp == 0) 72 if index_invalid.any(): 73 invalid_elem = invalid_elem + samples[col][index_invalid].tolist()

cahn20 commented 4 months ago

This is fixed by pull request #6 (already merged).