Closed him1532 closed 9 months ago
We have seen this ourselves, this was down to an undocumented implicit dependency in an upstream package that can mess up linking of curl. We fixed this for newer versions of the respective bioconda packages, but the changes pulling this into this workflow are still unmerged (and thus unreleased). They are in pull request #77. As a dirty fix to get past this, you could change the tag=
entry in the github()
function in the module import statement in your Snakefile
to instead point to a branch=
, using the branch behind that pull request:
github(..., branch="fix-canonical-transcript-mapped-read-extraction")`
However, this will stop working once we merge the pull request. So you will then have to switch back to the latest release. I'll try to remember to ping you here, once this is merged and released.
ok. got it. will try and report back.
I made changes to the Snakefiles as you suggested.
This time I have following error.
Error in group sleuth-init: jobs: rule compose_sample_sheet: jobid: 29 output: results/sleuth/model_X.samples.tsv log: logs/model_X.compose-sample-sheet.log (check log file(s) for error details) rule sleuth_init: jobid: 2 output: results/sleuth/model_X.rds, results/sleuth/model_X.designmatrix.rds log: logs/sleuth/model_X.init.log (check log file(s) for error details)
Error executing group job sleuth-init on cluster (jobid: 9159f0df-40ec-5076-ac29-7ac8600dd138, external: Your job 1509776 ("snakejob.sleuth-init.9159f0df-40ec-5076-ac29-7ac8600dd138.sh") has been submitted, jobscript: /home/him/RNA/.snakemake/tmp.isl2w3r_/snakejob.sleuth-init.9159f0df-40ec-5076-ac29-7ac8600dd138.sh). For error details see the cluster log and the log files of the involved rule(s). Removing output files of failed job compose_sample_sheet since they might be corrupted: results/sleuth/model_X.samples.tsv
This logs/model_X.compose-sample-sheet.log
has nothing in it.
While logs/model_X.compose-sample-sheet.log
out put is below.
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ✔ dplyr 1.1.4 ✔ readr 2.1.4 ✔ forcats 1.0.0 ✔ stringr 1.5.1 ✔ ggplot2 3.4.4 ✔ tibble 3.2.1 ✔ lubridate 1.9.3 ✔ tidyr 1.3.0 ✔ purrr 1.0.2 ── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ── ✖ dplyr::filter() masks stats::filter() ✖ dplyr::lag() masks stats::lag() ℹ Use the conflicted package (http://conflicted.r-lib.org/) to force all conflicts to become errors Rows: 12 Columns: 3 ── Column specification ──────────────────────────────────────────────────────── Delimiter: "\t" chr (3): sample, condition, path
ℹ Use
spec()
to retrieve the full column specification for this data. ℹ Specify the column types or setshow_col_types = FALSE
to quiet this message. Error inall_of()
: ! Can't subset columns that don't exist. ✖ Columnbatch_effect
doesn't exist. Backtrace: ▆
- ├─samples %>% drop_na(c(sample, path, all_of(variables)))
- ├─tidyr::drop_na(., c(sample, path, all_of(variables)))
- ├─tidyr:::drop_na.data.frame(., c(sample, path, all_of(variables)))
- │ └─tidyselect::eval_select(expr(c(!!!dots)), data, allow_rename = FALSE)
- │ └─tidyselect:::eval_select_impl(...)
- │ ├─tidyselect:::with_subscript_errors(...)
- │ │ └─rlang::try_fetch(...)
- │ │ └─base::withCallingHandlers(...)
- │ └─tidyselect:::vars_select_eval(...)
- │ └─tidyselect:::walk_data_tree(expr, data_mask, context_mask)
- │ └─tidyselect:::eval_c(expr, data_mask, context_mask)
- │ └─tidyselect:::reduce_sels(node, data_mask, context_mask, init = init)
- │ └─tidyselect:::walk_data_tree(new, data_mask, context_mask)
- │ └─tidyselect:::eval_c(expr, data_mask, context_mask)
- │ └─tidyselect:::reduce_sels(node, data_mask, context_mask, init = init)
- │ └─tidyselect:::walk_data_tree(new, data_mask, context_mask)
- │ └─tidyselect:::eval_context(expr, context_mask, call = error_call)
- │ ├─tidyselect:::with_chained_errors(...)
- │ │ └─rlang::try_fetch(...)
- │ │ ├─base::tryCatch(...)
- │ │ │ └─base (local) tryCatchList(expr, classes, parentenv, handlers)
- │ │ │ └─base (local) tryCatchOne(expr, names, parentenv, handlers[[1L]])
- │ │ │ └─base (local) doTryCatch(return(expr), name, parentenv, handler)
- │ │ └─base::withCallingHandlers(...)
- │ └─rlang::eval_tidy(as_quosure(expr, env), context_mask)
- ├─tidyselect::all_of(variables)
- │ └─tidyselect:::as_indices_impl(x, vars = vars, strict = TRUE)
- │ └─tidyselect:::chr_as_locations(x, vars, call = call, arg = arg)
- │ └─vctrs::vec_as_location(...)
- └─vctrs (local)
<fn>
()- └─vctrs:::stop_subscript_oob(...)
- └─vctrs:::stop_subscript(...)
- └─rlang::abort(...) Execution halted
Since there is a message with batch_effect
doesn't exist
following is the output of sample.tsv
sample condition batch_effect EOL-1 cond1 batch1 EOL-2 cond2 batch1 HL601 cond1 batch1 HL602 cond2 batch1 KG-1a1 cond1 batch1 KG-1a2 cond2 batch1 MOLT41 cond1 batch1 MOLT42 cond2 batch1 OCI-AML31 cond1 batch1 OCI-AML32 cond2 batch1 THP-11 cond1 batch1 THP-12 cond2 batch1
Is there something wrong?
There's nothing obviously sticking out for me.
But are you sure that your sample sheet is called sample.tsv
and not samples.tsv
(with an s). So maybe it is not properly loading your sample sheet.
Also, a batch_effect
with just one factor level (batch1
) for all samples doesn't make any sense, as the model cannot do any correction with that. If you are sure you don't have any batches, so basically if you know that all samples were handled in one big batch, all at the same time, then simply leave out the specification of batch effects. But this is rarely the case...
Also, the column names condition
and batch_effect
are arbitrary names that you can change, you then just need to update your model specification in the config/config.yaml
file accordingly.
And finally, just look at input and output files of the rule that is failing and those that come before it. Maybe you can spot what and where it goes wrong.
I hope this is resolved for you, so I am closing it for now. If you run into further problems, please feel free to open new issues for those.
I am seeing an error while running kallisto sleuth pipline.
When I looked at the log files.
Do I need to install some other library? Thank you.
Following is the output of snakemake/log/2023-12-18T142901.516512.snakemake.log