Closed chris-kreitzer closed 3 years ago
``06/17: updates on annotation transformation
The main goal is to provide unique gene_names in the Seurat object, so that we can somehow make sense of downstream analysis.
We have encountered several obstacles throughout this conversion; however, I now present some suggestions.
gene_short_name
from NVE models whenever a direct match between NVE and NV2
is possible. NV2
ID (mere number) if there is no match with NVE
NV2.'s
for one NVE
(there are obviously duplicated entries in the data.frame), so I just append a
, b
, etc. (length of duplication to respective gene_short_name
so that all entries are getting unique. NVE(...) matches NV2.108 & NV2.109; Since we adopt
ELK1-like annotation
from theNVE
, this entry (ELK1-like) is obviously duplicated.
I added nowELK1-like_a
for NV2.108 andELK1-like_b
for NV2.109; meaning that both products are essentially the same but got a unique identifier.
06/15: scRNA-analysis
I started with copying all the 4 libraries from the cluster to the local machine
(following Alison's script; with some modifications)
Excluding mitochondrial genes
Work with NV2_ID's rather than gene_short_names, as
rownames(lib1)
are the IDs and not the gene_short_names (as in Alison's annotation table)