Closed slsevilla closed 1 year ago
% cut -f1 samples.txt|sort|uniq -c|sort -k1,1nr
2
2 NHP21_TP1_CD14negCD16pos
1 NHP10_TP1_CD14negCD16pos
1 NHP10_TP2_CD14negCD16pos
1 NHP10_TP2_CD14posDRneg
1 NHP10_TP2_CD14posDRpos
1 NHP10_TP3_CD14negCD16pos
1 NHP10_TP3_CD14posDRneg
1 NHP10_TP3_CD14posDRpos
1 NHP10_TP4_CD14negCD16pos
1 NHP10_TP4_CD14posDRneg
1 NHP10_TP4_CD14posDRpos
1 NHP17_TP1_CD14negCD16pos
1 NHP17_TP1_CD14posDRneg
1 NHP17_TP1_CD14posDRpos
1 NHP17_TP2_CD14negCD16pos
1 NHP17_TP2_CD14posDRneg
1 NHP17_TP2_CD14posDRpos
1 NHP17_TP3_CD14negCD16pos
1 NHP17_TP3_CD14posDRneg
1 NHP17_TP3_CD14posDRpos
1 NHP17_TP4_CD14negCD16pos
1 NHP17_TP4_CD14posDRneg
1 NHP17_TP4_CD14posDRpos
1 NHP21_TP1_CD14posDRneg
1 NHP21_TP1_CD14posDRpos
1 NHP22_TP1_CD14posDRneg
1 NHP22_TP1_CD14posDRpos
1 NHP27_TP1_CD14negCD16pos
1 NHP27_TP1_CD14posDRneg
1 NHP27_TP1_CD14posDRpos
1 NHP27_TP2_CD14negCD16pos
1 NHP27_TP2_CD14posDRneg
1 NHP27_TP2_CD14posDRpos
1 NHP27_TP3_CD14negCD16pos
1 NHP27_TP3_CD14posDRneg
1 NHP27_TP3_CD14posDRpos
1 NHP27_TP4_CD14negCD16pos
1 NHP27_TP4_CD14posDRneg
1 NHP27_TP4_CD14posDRpos
1 replicateName
Observations:
% grep NHP21_TP1_CD14negCD16pos samples.txt .. 791 and 794.
NHP21_TP1_CD14negCD16pos TP1_CD14negCD16pos /data/NCI_VB/rawdata/CCRVB-13/190321_0539/fq/s794_r1.fq.gz /data/NCI_VB/rawdata/CCRVB-13/190321_0539/fq/s794_r2.fq.gz
NHP21_TP1_CD14negCD16pos TP1_CD14negCD16pos /data/NCI_VB/rawdata/CCRVB-13/190321_0539/fq/s791_r1.fq.gz /data/NCI_VB/rawdata/CCRVB-13/190321_0539/fq/s791_r2.fq.gz
This leads me to believe that the reps are incorrectly labeled. Can you double check this and run with unique replicateName's.
BTW, NHP21_TP1_CD14negCD16pos
is also the 39th replicate in your example. Its nothing to do with it being 39th , but the fact that replicateName is repeated with the 39th entry.
Ah! I looked at uniq too and didn't catch this.
I'll follow up with Kate and have her fix the manifest. Will update issue if this resolves it.
On Tue, Dec 13, 2022, 21:34 Vishal Koparde @.***> wrote:
BTW, NHP21_TP1_CD14negCD16pos is also the 39th replicate in your example. Its nothing to do with it being 39th , but the fact that replicateName is repeated with the 39th entry.
— Reply to this email directly, view it on GitHub https://github.com/CCBR/ASAP/issues/8#issuecomment-1350280518, or unsubscribe https://github.com/notifications/unsubscribe-auth/AE6EEILM3A5ENXAULB5PC6LWNEW3ZANCNFSM6AAAAAAS5WQFWM . You are receiving this because you were assigned.Message ID: @.***>
Issue resolved with fixing of replicate name. Adding an issue to check for replicate names to ensure this doesnt happen moving forward
I kicked off a run with yesterday without a problem (N=17). Tried to do a second run with more samples (N=40) and keep getting an error during init.smk.
I've been trying t troubleshoot it and the only thing that I can figure out is if I run it with 39 samples it runs fine, but as soon as I had the remaining sample, it errors. Doesn't matter which sample is at the end, always errors.
Specifically, the error is happening during init.smk, when its going through the reps to make sure that R1/R2 exist. I edited it to print out the rep, r1, r2 values
this works as normal for sample N=38:
but then, after when it gets to sample N=39 it does this:
It seems to be repeating the entire df again. If I run with N=39 it prints as expected.
I've also printed out the replicates, and those print as expected for both N=39
N=40
Finally, I have tried removing extra lines and carriages and that hasn't fixed it either. I don't get where the issue is coming from.
Attaching samples file as example. samples.txt
dryrun error
Using ASAP found here
Need an extra pair of eyes, @kopardev!