Closed tija94 closed 1 year ago
Hi @tija94,
For cram files the command is nextflow run main.nf -profile standard --build hg38 --gene cyp2d6 --format compressed
Kindly note that we haven't completed optimisation of StellarPGx for WES yet. Therefore, the results you get might be highly unreliable at the moment, especially for CYP2D6.
Thank you! Supplying --format compressed
did solve it.
Hi, I am having the same issue. While I did add the --format compressed flag. What could be wrong?
@Mahmoudbassuoni,
Could you share the line in your config file where you're supplying the --in_bam? Perhaps it's not pointing to the files correctly.
(base) mbassyouni@svmrchor-gns02:/Data/dataflash/StellarPGx$ ls data/*{cram,crai}
data/S1.recal.cram data/S1.recal.cram.crai data/S2.recal.cram data/S2.recal.cram.crai data/S3.recal.cram data/S3.recal.cram.crai
while the config file is like this
params {
// User-defined parameters
// reference genome
ref_file = "$PWD/Homo_sapiens_assembly38.fasta" // .fai index should be in the same folder
// BAM/CRAM file(s) and respective indexes
// example1 (single sample): /path/to/data/GeT-RM/NA12878*{bam,bai}
// example2 (multiple samples): /path/to/data/GeT-RM/*{bam,bai}
// example3 (CRAM files): /path/to/data/GeT-RM/HG*{cram,crai}
in_bam = "$PWD/data/*{cram,crai}"
// Output directoy (Default is $PWD/results). User-defined path should be an absolute path
out_dir = "$PWD/results"
Are the cram files in data
symbolic links?
yes, I have tried to point to the main directory that they in before the symbolic links but it didn't work either.
On some servers, it's possible that you can only access the files only if you're logged into particular node(s). It's also possible that your permissions to access the files have changed.
When you do ls data/
are the symbolic links coloured sky-blue-ish or are they coloured red-on-dark-background?
red on dark background yes
Exactly what I thought.
Red on dark background means you don't have access to those samples for one reason or the other. It's not an issue with StellarPGx.
It could be that you need to be logged into a worker node (or particular nodes on your system) to access the samples (can try launching StellarPGx when logged into a worker node) or it could be an issue with your file permissions. In some cases, the data may have been moved (you can try ls -l
to confirm that the symbolic links point to the correct paths). Anyway, all this means you need to contact your Sys Admin
I have tried it again while pointing to the main path without the symbolic link and it worked fine. I think the first time I ran it without the --format flag. Anyway, thank you for your response.
No worries
Hi,
I wanted to test StellarPGx with cram files. However, I do not get a result and I do not see any error. The only output I see is this
There is no additional logging (at least none that I am aware of) because the
work
directory is empty. I am running it like thisnextflow run main.nf -profile standard --build hg38 --gene cyp2d6
with the followingnetflow.config
:In my data directory I have cram, cram.crai, bam, and bam.bai files. When I am using the bam file of the same sample and changing
in_bam = "$PWD/data/*{cram,crai}"
toin_bam = "$PWD/data/*{bam,bai}"
it works.Am I doing something wrong? Are there other setting changes necessary? I am not familiar with nextflow, so if there is a possibility to see some logging elsewhere, please let me know. Thanks!
I do not know if this makes any difference but I am using WES data (not WGS) with a custom cyp2d6 test3.bed file: