Open KochTobi opened 4 months ago
Most of it looks good. i am not sure 100% about the 'muliplexed runs' thing though, what is meant by this. From what i know is that usually from a let's say Miseq BCL file (multiplexed), demultiplexed fastq files (1-many files per sample barcode) can be produced. Thus demultiplexing is the step from BCL to fastq.
Example would more be of this:
although this is also simplified. But the key is: it is lane003 in this example across samples. Let assume all files of lane003 might be corrupted and should be investigated in this case. then a file name search with a regex for "L003" would help.
To me it looks like the download API can make use of a query parameter e.g. filterType
and filter
.
I suggest:
filterType
:
filter
:
Is your feature request related to a problem? Please describe.
Projects with many samples can produce a large amount of data in one measurement. To analyse effects only observed in a subset of the generated data, a download of the whole measurement leads to a download containing all files. Many of the files are not of interest for the problem analysis. Problems could be data corruption or data quality issues. Issues could occur during muliplexed runs where only a subset is of interest.
Describe the solution you'd like As files of interest are known beforehand, a filter for a part of the filename would help to only download files of interest.
Describe alternatives you've considered @qbicStefanC any ideas?
Additional context