nservant / HiC-Pro

HiC-Pro: An optimized and flexible pipeline for Hi-C data processing
Other
382 stars 183 forks source link

Fixing bug due to split function #434

Closed joreynajr closed 3 years ago

joreynajr commented 3 years ago

After splitting a file at least 90 times the split command seems to change the naming scheme somehow so it starts using 4 digit numbers and so the old version of split_reads.py starts naming files in an odd way. This pull request is mostly to help generalize the previous split_reads.py script. I also included some test data (test.in.txt), kept the old split_reads.py as split_reads_old.py, and included a helper script test.sh so you can check out.

I guess most people won't be splitting their data into >90 splits but I ran into this issue and wanted to contribute. Let me know if you have any questions.

Joaquin

nservant commented 3 years ago

already in dev ! thanks