Closed andrewelamb closed 4 years ago
This should be repaired. Please pull staging.
@jonryser
The pcawg relationships did make it in, but there's are a lot of relationships that are now missing:
paste0(
"SELECT ttt.tag_id, ttt.related_tag_id, ta.name as parent, tb.name as child ",
"FROM tags_to_tags ttt ",
"INNER JOIN tags ta on ttt.related_tag_id = ta.id ",
"INNER JOIN tags tb on ttt.tag_id = tb.id"
) %>% perform_query()
# A tibble: 20 x 4
tag_id related_tag_id parent child
<int> <int> <chr> <chr>
1 20 110 PCAWG_Study BLCA-US
2 27 110 PCAWG_Study BRCA-US
3 28 64 Immune_Subtype C1
4 29 64 Immune_Subtype C2
5 30 64 Immune_Subtype C3
6 31 64 Immune_Subtype C4
7 33 64 Immune_Subtype C6
8 36 110 PCAWG_Study CLLE-ES
9 40 110 PCAWG_Study ESAD-UK
10 64 135 TCGA Immune_Subtype
11 64 109 PCAWG Immune_Subtype
12 86 110 PCAWG_Study LIRI-JP
13 99 110 PCAWG_Study MALY-DE
14 102 110 PCAWG_Study OV-AU
15 108 110 PCAWG_Study PACA-AU
16 110 109 PCAWG PCAWG_Study
17 126 110 PCAWG_Study RECA-EU
18 136 135 TCGA TCGA_Study
19 137 135 TCGA TCGA_Subtype
20 146 110 PCAWG_Study THCA-US
Immune_Subtype C5 is missing as are all of the TCGA studies and TCGA subtypes
This should result in all the different pcawg study types:
Also some dataset to tags relationships are missing:
Immune subtype should be related to both TCGA and PCAWG, PCAWG study should be related to PCAWG