Eliminate the need to purge metadata from the pipeline.

Sequencing runs from MiSeq and HiSeq were sometimes determined to be unsuccessful after the FASTQ data have been subjected to the DEL analysis pipeline. If the counts are too low, then the sequencing must be re-done by the original sequencing lab or by an alternative lab. As a consequence, the DEL analysis pipeline contains residual metadata from the first sequencing run. Scientists had requested to purge old metadata from the pipeline for fear that it will interfere with future analysis.

It is not sustainable to continue accommodating DEL analysis app's users requests to delete sample metadata and run metadata for whatever reason. Real multi-user web apps don't accommodate that kind of request from users.

The immediate solution is to instruct users not to reuse run_ids and samp_ids if a sequencing run fails and needs to be resequenced.

The long-term solution is to re-design the app to use a relational database that is implemented with auto-incremented indices as the primary keys of the run and sample metadata, which will allow users to re-use run_ids and samp_ids when re-doing sequencing runs.

broadinstitute / chem-bio-dos-del

Eliminate the need to purge metadata from the pipeline. #9