NBISweden / pipelines-nextflow

A set of workflows written in Nextflow for Genome Annotation.
GNU General Public License v3.0
43 stars 18 forks source link

[New pipeline] AnnotationToENA #30

Open Juke34 opened 4 years ago

Juke34 commented 4 years ago

See #17 for the general picture.

Here a description of the AnnotationToENA pipeline we need:

Input file: 2 => The GFF file along with the Fasta file Tool needed: AGAT, EMBLmyGFF3 both available by Bioconda and webin-cli-.jar from https://github.com/enasequence/webin-cli (they provide a docker. We can create a bioconda recipe) Output file 1 => EMBL Flat file Required parameters: (all for EMBLmyGFF3):

Step1: agat_sp_flag_short_intron.pl --gff annotation.gff -o annotation_short_intron_flagged.gff Step2: agat_sp_fix_features_locations_duplicated.pl --gff annotation_short_intron_flagged.gff -o annotation_short_intron_flagged_duplicated_location_fixed.gff Step3:

Step4: validation using the Webin-CLI command line submission program that supports validation using the -validate option: see here https://github.com/enasequence/webin-cli