jump-cellpainting / JUMP-Target

Lists and 384-well plate maps of compounds and genetic perturbations designed to assess connectivity in profiling assays
MIT License
17 stars 5 forks source link

Matching samples from JUMP-Target-1-Compound and JUMP-Target-2-Compound #18

Open shntnu opened 3 years ago

shntnu commented 3 years ago

InChIKey is sufficient for linking the two (broad_samples are different, but InChIKey are the same)

suppressPackageStartupMessages(library(tidyverse))
suppressPackageStartupMessages(library(magrittr))

t1 <- read_tsv("https://raw.githubusercontent.com/jump-cellpainting/JUMP-Target/master/JUMP-Target-1_compound_metadata.tsv")

t2 <- read_tsv("https://raw.githubusercontent.com/jump-cellpainting/JUMP-Target/master/JUMP-Target-2_compound_metadata.tsv")

t1 %>% distinct(InChIKey) %>% count()
#> # A tibble: 1 x 1
#>       n
#>   <int>
#> 1   307

t2 %>% distinct(InChIKey) %>% count()
#> # A tibble: 1 x 1
#>       n
#>   <int>
#> 1   307

inner_join(t1 %>% distinct(broad_sample), t2 %>% distinct(broad_sample)) %>% count()
#> Joining, by = "broad_sample"
#> # A tibble: 1 x 1
#>       n
#>   <int>
#> 1   178

inner_join(t1 %>% distinct(InChIKey), t2 %>% distinct(InChIKey)) %>% count()
#> Joining, by = "InChIKey"
#> # A tibble: 1 x 1
#>       n
#>   <int>
#> 1   307

Created on 2021-05-03 by the reprex package (v0.3.0)