Closed Vlad-Dembrovskyi closed 2 years ago
Currently this is the method:
I would like to:
Example of manifest.json file (we will use the reads.csv to subset the "file_name" in the manifest(there could be several file types in the manifest):
Example of manifest.csv file (we will use the reads.csv to subset the second column/"file_name" in the manifest):
Alternatively, we could give the specimen id GTEX-XXXX-XXXX-XX-XXXXX and it subsets the manifest for the .bam file entries.
This is possibly preferred, but it is important to note:
Addition: we can save the filenames requested but not found in original manifest file into a not_found_GTEX_samples.txt file. We Should also print them as warnings to stdout.
Currently we need to manually edit the manifest file before using it for pipeline to only include the samples of interest. We need a way to only provide samples of interest and manifest to pipeline so that pipeline edits the manifest itself. Example code following shortly.