Open mgavery opened 2 years ago
maybe a more fair way to make a cut off would be something like: must be expressed in at least 5% of the cells for a given developmental stage (this would acct for diffs in the number of cells sequenced for each type of library)
There are 10,494 cells in the early dev samples
There are 3 types of libraries we made
The cleavage pool and blastula samples are part of the same "experiment" so they are combined below in a file called "CPbla"
I downloaded a table that has the "number of cells expressed" for each gene for each of these 3 types of libraries. We can use this to see what proportion of all genes are expressed at each stage and to look for overlaps/differences in genes expressed in each cluster.
As a note - these libraries all have different sequencing "saturation" meaning it might not mean much if a gene is expressed lowly in only a few cells in one stage and not the other. We probably need some kind of cutoff for that (e.g. only counted if genes expressed in >50 or >100 cells) cleavage_geneBYcellsexpressed.txt gastrula_geneBYcellsexpressed.txt blastula_geneBYcellsexpressed.txt CPbla_geneBYcellsexpressed.txt .