Closed willrosenow closed 4 years ago
Hi Will,
Would you be able to show the contents of the log file referenced in your error /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/logs/log.compute_LD_scores.tabula_muris-test.1.txt
? It's difficult to be certain without seeing that.
However, if I had to guess I would think that it's because you don't have the LD score regression submodule, it should be here /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/ldsc/
. This issue arises when you clone the repo without using --recurse-submodules
.
If you delete your CELLECT installation and reinstall with git clone --recurse-submodules https://github.com/perslab/CELLECT.git
that should fix the error. If not, post the contents of the log file and I'll try my best to give a better diagnosis.
Best, Tobi
Hi Tobi,
Thanks for the quick response. Here is the log file.
Thanks, Will
On Thu, Feb 27, 2020 at 6:19 PM Tobi notifications@github.com wrote:
Hi Will,
Would you be able to show the contents of the log file referenced in your error /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/logs/log.compute_LD_scores.tabula_muris-test.1.txt
Best, Tobi
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/perslab/CELLECT/issues/43?email_source=notifications&email_token=ALLW6FRIUYLX3746HMY5DLDRFBDB5A5CNFSM4K47XSB2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENGKZMY#issuecomment-592227507, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALLW6FWP6Z2WBSXOGMPJWWLRFBDB5ANCNFSM4K47XSBQ .
Traceback (most recent call last):
File "/sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/ldsc/ldsc.py", line 23, in
Hi Will,
That log message is super helpful! It seems to be a bug in an old version of pytest which your system python has installed.
As a quick-fix I'd suggest updating your system python 2.7 /home/wr8yp/.local/lib/python2.7/
to ensure it has pytest version >=4.2.1. Let me know how that pans out.
Snakemake's use of conda should ensure that all packages are tightly controlled within the conda environments but for some reason this isn't the case here. This is something we will need to look into and fix so thank you for bringing it to our attention!
Best, Tobi
Hi Tobi,
Thanks! That fixed the problem and now the tutorial data runs correctly.
However, I am still getting an error with my data using the same environment. Here is the following error with the log file:
***** ERROR: Requested column 4, but database file /scratch/wr8yp/CELLECT_test/precomputation/SCRNA/bed/tmp/pybedtools.65f2cdgd.tmp only has fields 1 - 0.
Traceback (most recent call last):
File "/sfs/lustre/bahamut/scratch/wr8yp/CELLECT_test/.snakemake/scripts/
tmpp9r9q9eb.format_and_map_snake.py", line 111, in
bedtools merge -o distinct,max -i /scratch/wr8yp/CELLECT_test/precomputation/SCRNA/bed/tmp/pybedtools.so_mzpxn.tmp -c 4,5
Error message was:
***** ERROR: Requested column 4, but database file /scratch/wr8yp/CELLECT_test/precomputation/SCRNA/bed/tmp/pybedtools.so_mzpxn.tmp only has fields 1 - 0.
[Mon Mar 2 11:06:32 2020] Error in rule format_and_map_genes: jobid: 122 output: /scratch/wr8yp/CELLECT_test/precomputation/SCRNA/bed/SCRNA.cluster_6.bed log: /scratch/wr8yp/CELLECT_test/logs/log.format_and_map_snake.SCRNA.cluster_6.txt (check log file(s) for error message) conda-env: /sfs/lustre/bahamut/scratch/wr8yp/CELLECT_test/.snakemake/conda/d964a65b
On Sat, Feb 29, 2020 at 5:16 PM Tobi notifications@github.com wrote:
Hi Will,
That log message is super helpful! It seems to be a bug in an old version https://github.com/pytest-dev/pytest/issues/4770 of pytest which your system python has installed.
As a quick-fix I'd suggest updating your system python 2.7 /home/wr8yp/.local/lib/python2.7/ to ensure it has pytest version
=4.2.1. Let me know how that pans out.
Snakemake's use of conda should ensure that all packages are tightly controlled within the conda environments but for some reason this isn't the case here. This is something we will need to look into and fix so thank you for bringing it to our attention!
Best, Tobi
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/perslab/CELLECT/issues/43?email_source=notifications&email_token=ALLW6FRT5BKFYVA6X5CIJITRFGEMBA5CNFSM4K47XSB2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENMHNMQ#issuecomment-593000114, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALLW6FSVXSSP3RR757XFUQDRFGEMBANCNFSM4K47XSBQ .
Making gene set bed files Failed setting pybedtools tempdir to /scratch/wr8yp/CELLECT_test/precomputation/SCRNA/bed/tmp. Will use standard tempdir /tmp Merging input multi gene set with gene coordinates for annotation = cluster_0
Hi Will,
From the logfile it seems like you're providing CELLECT with mouse genes. CELLECT only takes Ensembl human genes as input.
If you used CELLEX to compute expression specificity for each gene, make sure to also map Ensembl mouse genes to Ensembl human orthologues.
Hopefully with that, everything should run smoothly. If not, let me know.
All the best, Tobi
That worked, Thank you!
On Mon, Mar 2, 2020 at 11:42 AM Tobi notifications@github.com wrote:
Hi Will,
From the logfile it seems like you're providing CELLECT with mouse genes. CELLECT only takes Ensembl human genes as input.
If you used CELLEX to compute expression specificity for each gene, make sure to also map Ensembl mouse genes to Ensembl human orthologues https://github.com/perslab/CELLEX#pro-tip-using-cellex-with-cellect.
Hopefully with that, everything should run smoothly. If not, let me know.
All the best, Tobi
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/perslab/CELLECT/issues/43?email_source=notifications&email_token=ALLW6FRV2IF7KAFD5ML53U3RFPOXHA5CNFSM4K47XSB2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOENQAVOI#issuecomment-593496761, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALLW6FX3YL3MRPKNLTHEFJDRFPOXHANCNFSM4K47XSBQ .
Hello,
I am trying to run the example from CELLECT using the following command:
snakemake --use-conda -j -s cellect-ldsc.snakefile --configfile example/config-ldsc_example.yml
The job runs for a while, but then produces the following error:
Error in rule compute_LD_scores: jobid: 184 output: /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1.l2.ldscore.gz, /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1.l2.M, /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1.l2.M_5_50, /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1.log log: /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/logs/log.compute_LD_scores.tabula_muris-test.1.txt (check log file(s) for error message) conda-env: /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/.snakemake/conda/8cf508d1 shell: /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/ldsc/ldsc.py --l2 --bfile /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/data/ldsc/1000G_EUR_Phase3_plink/1000G.EUR.QC.1 --ld-wind-cm 1 --annot /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1.annot.gz --thin-annot --out /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/precomputation/tabula_muris-test/tabula_muris-test.COMBINED_ANNOT.1 --print-snps /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/data/ldsc/print_snps.txt &> /sfs/lustre/bahamut/scratch/wr8yp/CELLECT-3/CELLECT/CELLECT-LDSC-EXAMPLE/logs/log.compute_LD_scores.tabula_muris-test.1.txt
Any idea what might be causing this?
Thanks, Will