Open drtamermansour opened 4 years ago
This experiment will be associated with several types of files:
Now, When we look at metadata of a file, we need to consider The type of the file. In a perfect world, each raw data file should contain data from one platform or even one platform unit if the platform has multiple units (e.g. Illumina machine has multiple lanes). Intermediate and final output files should annotate every data aggregate. For example, in a BAM file, we should have annotation for every read group (or the Platform unit if available). With VCF or count tables, we should have annotation for every column
Before we start, let us talk about one experimental design as an example to highlight what metadata we might need to annotate the files of this experiment . We have a sequencing experiment to test RNA expression and DNA mutations in skin cancer.