OstfriesenBI / PredmiRNA

A set of scripts and tools to train a classifier for pre-miRNA Recognition
1 stars 0 forks source link

Script/Program: Split csv file #6

Closed Finesim97 closed 5 years ago

Finesim97 commented 5 years ago

To enable parallel processing, the input csv files have to be split. The script (python, R, java, bash, awk) takes 1 csv file as an input and splits it into a given number of csv output files and stores them in a given folder: csvsplit input.csv 200 outputdir/ Would generate outputdir/0.csv outputdir/1.csv ... outputdir/199.csv. Every file has to start with the same header line as the one in the input file.

mariusrueve commented 5 years ago

Currently I have a problem finding a solution to get the "header" into every single .csv and to give a output directory. d1cf283

mariusrueve commented 5 years ago

Final result and comments in 5f0f246.