rkimoakbioinformatics / oakvar

Genomic variant analysis platform
Other
32 stars 8 forks source link

read_crv first row deletion fix #54

Closed sinmojito closed 1 year ago

sinmojito commented 1 year ago

The old script was skipping row 1 and inserting it as a header, and then replacing the header with new_columns.

When has_header=False is specified, it fixes the row deletion issue but then reads the final comma of the .crv file as an additional column. Thus, df.select is needed to ignore that column.

I also replaced the while loop with the inbuilt comment_char = # from the Polars documentation, which is handy.

rkimoakbioinformatics commented 1 year ago

Thanks. I appreciate it.