Closed rxu17 closed 8 months ago
Also feels like we need to also enforce an expected schema for the full maf file during the end of process_mutation given that we have this new variable genomic_location_explanation
.
:tada: All dependencies have been resolved !
The SonarCloud Quality Gate passed, but some issues were introduced.
9 New issues
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code
It was decided to just use a list of release variables for the full maf file to subset out the non-release variables.
Purpose: This is a draft PR. This PR is only allowing in certain release variables (excluding out the
genomic_location_explanation
andAnnotation_Status
variables for example) for the full maf csv.Changes:
create_missing_columns
inprocess_functions.py
Decided with this implementation on excluding variables because we have older full maf files that may have been processed 1-N releases ago, and doesn't contain new columns and features we added to maf processing since then. Because we append old full maf files and more recently processed maf files, we want to make sure they conform to a specific expected schema so we set the expected columns they should have and if they don't have the column, it's created and filled with missing values.Testing:
Depends on #545