galaxyproject / training-material

A collection of Galaxy-related training material
https://training.galaxyproject.org
MIT License
303 stars 891 forks source link

Workflow for Proteogenomics 1 #2354

Closed assuntad23 closed 3 years ago

assuntad23 commented 3 years ago

In the course of testing for Smörgåsbord , I came across a few error swith the workflow for https://training.galaxyproject.org/training-material/topics/proteomics/tutorials/proteogenomics-dbcreation/tutorial.html

assuntad23 commented 3 years ago

It's also worth noting that the tutorial assumes that the user will know to use the regex "Replace text in a specific column" for the Change the chromosome names in the Ensembl GTF to match a UCSC genome reference step.

I feel like that should be more explicit, and/or perhaps that regular expression step should be part of the Workflow.

subinamehta commented 3 years ago

Thank you @assuntad23 for testing our training. I tested the workflow but I couldn't replicate the Freebayes and gffcompare error in Galaxy(https://usegalaxy.eu/u/galaxyp/h/test) . However, I have provided a note regarding the "Tabular-to-Fasta" step in the GTN ( to specify column 1 as title). Also, I have added additional information regarding "Replace text in a specific column" regex statement.

natefoo commented 3 years ago

@subinamehta I think this is because the GTF used as input for StringTie in the tutorial is the original uploaded version and not the one we've manipulated the chromosome names of. I am working on a few clarifications in #2368 and mention it there.