vdemichev / DiaNN

DIA-NN - a universal automated software suite for DIA proteomics data analysis.
Other
279 stars 54 forks source link

Use only proteotypic peptides for quantification? #565

Closed jessegmeyerlab closed 5 days ago

jessegmeyerlab commented 1 year ago

I am wondering if DIA-NN uses only peptides that are unique to their assigned group for quantification of that group? For example, we are working with data from muscle where myosin heavy isoforms have high sequence overlap. We want to make sure that peptides which are shared across all the myosins are not used for quantification of any isoform.

vdemichev commented 1 year ago

For Genes.MaxLFQ.Unique - yes. Otherwise, all assigned to the group precursors are used. A custom selection of peptides for quantification is possible if using the 'iq' or 'diann' R package for MaxLFQ, on the basis of precursor quantities reported by DIA-NN.

vdemichev commented 1 year ago

If different isoforms you are interested in share the gene name in the uniprot database, then I guess the way to go is indeed (i) set protein inference in DIA-NN to 'isoform' and (ii) filter the peptides by proteotypicity, based on the final report. Alternatively, can infer proteotypicity of the peptides e.g. from the DDA search results used to generate the library (I think FragPipe allows this).

jossmi commented 1 year ago

I have a follow-up question: In the methods section of the DIA-NN paper, it says: "A ‘top 3’ method is eventually used for protein quantification: intensities of protein groups are calculated as sums of the intensities of the three most abundant precursors identified with a q-value lower than a given threshold in a particular acquisition."

When you said "Otherwise, all assigned to the group precursors are used.", do you mean "all [of the top three] assigned...precursors", or do you mean that all precursors for the protein group, including those outside of the top three precursors (e.g., ranks 4 through n), are used?

vdemichev commented 1 year ago

That's the old DIA-NN version. Now it's MaxLFQ, which uses all precursors.

jossmi commented 1 year ago

Ok, thanks for clarifying.