Open gsaxena888 opened 1 month ago
aligned_rt
value will be different (since aligned RT = actual RT in the case of 1 file, and aligned RT ~ mean RT for that peptide across files in the case of > 1 file), and that will impact some of the features that feed into LDA. Reading a FASTA file should be nearly instant, but generation of modified peptides & the fragment index will be a bottleneck for large search spaces (lots of peptides, lots of variable mods, etc). There isn't really a huge benefit to splitting the MGF, except that each MGF file will be parsed in parallel. Spectra are always searched in parallel, regardless of how many files they were parsed from.wide_window = True
) mode works in Sage. The isolation window is multiplied by the charge states specified (by default, 2, 3, 4) and the spectrum is searched multiple times with different tolerancesreport_psms = 5
) with an open search. There is also a PR you could test out: https://github.com/lazear/sage/pull/120
I've been playing with Sage recently, and love it! A few random questions all over the board:
Thanks in advance!