enasequence / sequencetools

Webin sequence validation API.
Apache License 2.0
10 stars 3 forks source link

Task/reducedflatfile #70

Closed prabh-t closed 3 years ago

prabh-t commented 3 years ago

PR related to user story ENA-4187. Related PR: ena-assembly-processing PR#8 Contains some minor refactoring, new entry writer for reduced flat file and test:

  1. Specify what is stored in reduced flat files that contain reduced header information (ID, feature table without source features, sequence also for AGP/CON, CO lines for AGP/CON).
  2. Create reduced flat files for contigs (Sequencetools). This will be the main archived file product. This will be used in assembly and transcriptome pipelines.
  3. Create reduced flat files for scaffolds (AGP/CON) (Sequencetools) . This will be the main archived file product. This will be used in assembly pipeline.