Add additional information to reports

Fedja commented 4 years ago

GenenTech have been scraping and joining additional information to top report and variant reports, which has been complicated by changing file structure from our part. We should deliver these automatically. Attached are example files from Sarah Pendergrass.

These should be in R6 reportings!

Main things:

phenotype info: longname, category, n cases, n controls (create single file from these when phenotypes are releases)
previous release beta and p-value
ukbb replication (they had used the same old ukbb results we show in browser). Ultimately we want to use the new saige results and phenotype mapping but can add this now. I communicated that we'll tackle others first and then see if we have time to do proper implementation before next release

EnrichedVariants_Example.txt GroupReports_Example.txt

Lipastomies commented 4 years ago

Alright, let's think about this.

The phenotype info is straightforward as soon as we have the single file where these things are available.

The previous release data is quite straightforward, it's basically similar to the other annotations, the file is just different. Should not be too hard to implement, most likely the previous release files just need to be added to the wdl input array for this to be easily done. It probably makes sense to separate the WDL work from the script changes into their own PRs.

The UKBB replication could be done either as a separate comparison datasource, or we could add it as an annotation. The main challenge here is probably how we couple the UKBB data to the script - Do we have some sort of API or some other file from which we pull these results.

Lipastomies commented 4 years ago

The longname, category, n cases/controls are found in a file named finngen_R5_pheno_n.tsv in green library. I'll add that as an annotation file. I'll use that as the phenotype info file.

FINNGEN / autoreporting

Add additional information to reports #118