Closed bbimber closed 3 years ago
One aside: my thinking is that the CSV format would make a friendly human-editable way to prepare and augment the library+metadata. Assuming we can reference genbank in this CSV, we would not need to store the literal sequence, so file size should be relatively small, and suitable to check into repos. The user (or an automated test) would take one of those inputs, run nimble to hydrate it and make the compiled JSON, and then align.
The current workflow is:
there is no good reason step 2 should exist. I'd propose:
In the first step, the input patterns could be: