vdemichev / DiaNN

DIA-NN - a universal automated software suite for DIA proteomics data analysis.
Other
283 stars 53 forks source link

Why generate an in silico-predicted library in the first place? #1238

Open FDelalande opened 3 weeks ago

FDelalande commented 3 weeks ago

Hello,

In the version 1.9.2, i have this error message :

"WARNING: it is strongly recommended to first generate an in silico-predicted library in a separate pipeline step and then use it to process the raw data, now without activating FASTA digest"

Why do this in 2 steps when with older versions there was only one step: DIA-NN did the spectral library before processing the analyses?

Merci

François

vdemichev commented 3 weeks ago

Hi Francois,

Several reasons:

  1. Library can be reused - saves time.
  2. Library as a result of prediction will depend on the CPU instruction set - so generating just once aids reproducibility.
  3. A bit less RAM usage.
  4. And this is the main reason really: we do not want to support two modes for the same task, i.e. 'on-the-fly generation and raw data analysis' is not being validated properly.

Best, Vadim