Closed ncarrut closed 4 years ago
Hey Nick,
ah yes, this could be slightly confusing: The two ways of specifying an experimental design are mutually exclusive.
Either:
--input your_sdrf.tsv
plus (1) optionally a --root_folder
in case the spectrum files are lying in a different location than in the SDRF. Plus (2) optionally --local_input_type
(in case you converted already, and the files are in mzML, as opposed to raw in the SDRF).
You should not specify an additional experimental design here since both the filenames and the design are already included in the SDRF. (I will open an issue that the pipeline warns and fails from the beginning in the future). Instead, the "main" --input
parameter needs to be the sdrf.
Or:
--input run*_foo*.mzML/raw
plus a then mandatory experimental design (--expdesign
) file in OpenMS format.
I think on the website the parameter documentation is more clearly structured than on the command line.
TLDR:
Use --input /wsu/home/aj/aj76/aj7682/2047/sdrf.tsv --root_folder /wsu/home/aj/aj76/aj7682/2047/raw/
instead. You might not even need the --root_folder
if the filenames in the SDRF are local and absolute. It was introduced for SDRFs with e.g. FTP links.
Thanks that worked.
Hi, I've run the pipeline successfully with test and with the pxd001819 dataset but can't get it to run with my own data. The data are being supplied locally and I'm not sure I've specified the parameters right for that. My command is:
../nextflow/nextflow run nf-core/proteomicslfq -r dev -profile singularity --input /wsu/home/aj/aj76/aj7682/2047/raw/*.raw --expdesign /wsu/home/aj/aj76/aj7682/2047/sdrf.tsv --database /wsu/home/aj/aj76/aj7682/2047/UP000005640.fasta --add_decoys --precursor_mass_tolerance 20 --fragment_mass_tolerance 0.5
My sdrf file is here sdrf.txt. It's valid according to the sdrf_parser. The pipeline launches successfully and makes the decoy database. But I notice that the spectra processing steps indicate 1 of 1 completed where there are 34 raw files so it should be 1 of 34 etc. The error message is below. When I look at proteomicslfq.log I see;
Error: Unable to read file (Error: Missing column header: Fraction in: sdrf.tsv)
. Can anyone help me troubleshoot this please? Thanks, Nick