Closed alisman closed 2 months ago
@inodb @sheridancbio the strategy here will require that a new table be populated whenever data is updated. what is the right way to script this so it always happens? likewise, what is protocol for doing schema updates?
@alisman after this you could run updateDenormalizedClinicalDataViews()
: https://github.com/cBioPortal/cbioportal-core/blob/main/src/main/java/org/mskcc/cbio/portal/scripts/ImportClinicalData.java#L170
The main bottlenecks for clinical data tab are
On genie dataset (190k samples), a search takes ~30s. These optimizations bring that down to 6s.
This PR addresses these bottlenecks by