nf-core / seqinspector

QC pipeline to inspect your sequences
https://nf-co.re/seqinspector
MIT License
3 stars 13 forks source link

Input structure #1

Closed alneberg closed 3 months ago

alneberg commented 3 months ago

Description of feature

Discussion issue of the input structure.

There are at least 5 levels of metadata for a single input fastq:

  1. File ID
  2. Sample ID
  3. Lane ID
  4. Flowcell ID
  5. Project ID

The flowcell ID is connected to a run folder where metrics for the flowcell can be found (Illumina use case mainly).

alneberg commented 3 months ago

PXL_20240318_105906903

Aratz commented 3 months ago

That's the test data I was talking about. It needs to be demultiplexed first though

https://github.com/nf-core/test-datasets/tree/demultiplex/testdata/NovaSeq6000