Open alisman opened 1 week ago
Missing sample is TCGA-AO-A1KO-01
{
"uniqueSampleKey": "VENHQS1BTy1BMUtPLTAxOmJyY2FfdGNnYV9nZGM",
"uniquePatientKey": "VENHQS1BTy1BMUtPOmJyY2FfdGNnYV9nZGM",
"sampleId": "TCGA-AO-A1KO-01",
"patientId": "TCGA-AO-A1KO",
"studyId": "brca_tcga_gdc"
}
The sample TCGA-AO-A1KO-01
doesn't have a mutation profile for this study, so it is excluded by the study view filter.
To verify run the SQL query below.
SELECT
sp.sample_id as sampleInternalId,
sd.sample_stable_id as sampleStableId,
sd.sample_unique_id as sampleUniqueId,
gp.stable_id as geneticProfile
FROM cgds_public_v5.sample_profile sp
JOIN cgds_public_v5.sample_derived sd on sp.sample_id=sd.internal_id
JOIN cgds_public_v5.genetic_profile gp on sp.genetic_profile_id=gp.genetic_profile_id
WHERE sd.sample_stable_id='TCGA-AO-A1KO-01' AND sd.cancer_study_identifier='brca_tcga_gdc'
Clickhouse SQL implementation is applying AND
logic for the given genomic profiles.
Legacy SQL might be applying OR
logic. Need to investigate further to confirm.
Actually, legacy implementation is also applying AND
logic, but it's getting the profile information from the gene panel.
And according to the gene panel the sample TCGA-AO-A1KO-01
has the mutations genomic profile.
@onursumer i don't really understand how gene panel can be used because, unless i'm totally mistaken, there is no relation from gene panel to sample. gene panel only says what genes are profiled by a given genetic_profile? so i guess the question is, how is the above genePanelData derived?
Here we get the sample id from gene panel datum by datum.getSampleId()
. Not sure how we integrate sample id to gene panel data but here is the related class member.