Should we have COMPLEX: in the gene symbol columns as well as the Uniprot columns?
For using squidpy and chinpy, it expects this, so I wrote this code to update the list:
intercell_vienna = pd.read_csv("https://raw.githubusercontent.com/JanLeipzig/ViennaCCCdb/main/ViennaCCCdb.csv", sep="\t")
# add complexes
intercell_vienna["source_genesymbol"] = ["COMPLEX:" + x if "_" in x and "COMPLEX:" not in x else x for x in intercell_vienna["source_genesymbol"]]
intercell_vienna["target_genesymbol"] = ["COMPLEX:" + x if "_" in x and "COMPLEX:" not in x else x for x in intercell_vienna["target_genesymbol"]]
But if other tools will mess up with it (need to double check others), maybe best to leave out? I think I vaguely remember the cell2cell vectors needing without this string, and maybe LIANA as well.
Should we have COMPLEX: in the gene symbol columns as well as the Uniprot columns?
For using squidpy and chinpy, it expects this, so I wrote this code to update the list:
But if other tools will mess up with it (need to double check others), maybe best to leave out? I think I vaguely remember the cell2cell vectors needing without this string, and maybe LIANA as well.