Open AkilaWijerathnaYapa opened 8 months ago
Protein names & genes are read correctly from UniProt fastas, from all other fastas - not guaranteed. But solution is simple (like 20 min of work maybe): load the FASTA in R using some R package and use Protein.Group entries to generate correct Protein.Names and Genes. When having this issue, please also set DIA-NN's implicit protein grouping to Isoforms.
Thank you Vadim. Could you please let me know what do you mean by "set DIA-NN's implicit protein grouping to Isoforms"? Where to find this setting in DIA-NN GUI? Do I have to re-run the DIA-NN analysis? Please share if there's any tutorial is available.
'Protein inference' setting
You can rerun the analysis with 'Use .existing quant files' enabled
Thank you Vadim.
I am performing DIA-NN from FASTA digest library for Arabidopsis thaliana. I got FASTA file from plants.ensembl
However after smooth DIA-NN run, in final report.tsv file all corresponding Proteins.Ids, both Protein.Names and Genes column data shows as pep.
What might be the issue? Is this because of FASTA file data? or Can I trust the report.tsv file data?