After splitting a file at least 90 times the split command seems to
change the naming scheme somehow so it starts using 4 digit numbers
and so the old version of split_reads.py starts naming files in an odd way.
This pull request is mostly to help generalize the previous split_reads.py script.
I also included some test data (test.in.txt), kept the old split_reads.py as
split_reads_old.py, and included a helper script test.sh so you can check out.
I guess most people won't be splitting their data into >90 splits but I
ran into this issue and wanted to contribute. Let me know if you have
any questions.
After splitting a file at least 90 times the split command seems to change the naming scheme somehow so it starts using 4 digit numbers and so the old version of split_reads.py starts naming files in an odd way. This pull request is mostly to help generalize the previous split_reads.py script. I also included some test data (test.in.txt), kept the old split_reads.py as split_reads_old.py, and included a helper script test.sh so you can check out.
I guess most people won't be splitting their data into >90 splits but I ran into this issue and wanted to contribute. Let me know if you have any questions.
Joaquin