Expression data - Githubissues

Hi, I am sorry for the delay becuase I did not receive any reminder.... The expression data can be downloaded from the AWS links in the 'Data sources' part, like 'https://s3.amazonaws.com/mousescexpression/rank_total_gene_rpkm.h5'.

The expression data has the shape of N*M where N is cell number and M is gene number.

The shape of NEPDF data is X 32 32, where X depends on the length of gene pair list you would like to train and test. For example, the number of all possible gene pairs are M M, while you only focus on X pairs of them and you will generate one 32 32 histogram matrix for each gene pair of the whole X pairs, where 32 means that the expression range of the gene is uniformly divided into 32 bins. The whole idea for CNNC can be found in the paper 'Deep learning for inferring gene relationships from single-cell expression data'. The compression is through the 32 * 32 histogram generation for each gene pair. Best

xiaoyeye / CNNC

Expression data #4