vdemichev / DiaNN

DIA-NN - a universal automated software suite for DIA proteomics data analysis.
Other
279 stars 54 forks source link

Running command line with multiple libraries #870

Open aliibarry opened 11 months ago

aliibarry commented 11 months ago

Working to parallelize my pipeline, based on the following:

"Another option is to reduce the precursor mass range, that is search mass ranges 400-500, 500-600, 600-700, etc, separately - create a spectral library from DIA data separately for each mass range, then merge these libraries (e.g. by supplying multiple --lib commands to DIA-NN) and reanalyse the whole dataset with the merged library."

Is there an example script to run this? I'm struggling to load the spectral libraries for reanalysis and suspect I'm missing something very basic.

3 files will be processed
WARNING: multiple spectral libraries are specified; this mode is experimental, and DIA-NN does not check if different libraries are in the same format, have consistent modification names or reference RT scales; all libraries must be in .tsv format
[0:00] Loading spectral library diann_out/trial_binned_parallel/reporta_part1-lib.tsv
cannot read the file
ERROR: src/diann.cpp: 19135: Cannot load spectral library
vdemichev commented 11 months ago

The error here: diann_out/trial_binned_parallel/reporta_part1-lib.tsv not readable (read persmissions, etc) or not in the right format. But why do you need it? For almost all applications 64Gb-128Gb RAM is perfectly enough, i.e. you don't need to use this workaround.