Closed tarak77 closed 5 years ago
Hey Tarak,
First of all, I notice that the vast majority of your contacts' haplotypes are 0
. This is not suitable for imputation because you're only getting contacts from one haplotype (0
) of a diploid.
I see. Is there a way around this problem?
The data I have is from an egg cell. Could I skip the impute
steps?
I think it'll be very difficult to obtain a 3D structure for an egg cell, because a large fraction of its genomes are completely identical between the two copies.
Aren't egg cells haploid? Just wondering if either this repo or hickit could help model those?
At least in human, eggs are diploid. They only become haploid upon fertilization, by which time they're called "female pronuclei".
Btw if your experiments did not remove the polar body (the first polar body, in this case) from the egg, they'll be tetraploid.
As a result, this repo will only be able to model the part where the two copies are not identical (as a result of crossovers), but not the part where the two copies are identical.
The way I am thinking is that before fertilisation, the unfertilised egg and sperm are haploid(with single copy of each chromosome be it mouse or human). In the zygote, you will have male and female pronuclei as two spheres, each with 23 chromosomes..
Am I still missing something?
This figure is not accurate. Please refer to appropriate textbooks on meiosis or reproductive biology.
Oh okay. Thanks!
Hey @tanlongzhi , This might be similar to issue https://github.com/tanlongzhi/dip-c/issues/26#issue-407292326 but i was trying to model the data from an unfertilized egg cell. To do so, I use hickit for the initial preprocessing steps and then for 3D reconstruction and imputation I use this repo.
To use this repo I convert the obtained
impute.pairs
andcontact.pairs
files from hickit to.con
files byusing hickit_pairs_to_con.sh
andhickit_impute_pairs_to_con.sh
scripts.During the first round of 3D reconstruction, I am seeing that the
.3dg
file has both(mat)
and(pat)
rows. I find this strange because the.ncc
file only has one haplotype notation(pat)
. Also note here that even though the data is from an egg cell, my0
is actuallymat
and1
ispat
. Because I messed up while defining the SNP file, so thats why i have(pat)
in my.ncc
file.So thats why maybe when implementing the
dip-c impute3 -3
code, i get an errorAgain, your help will be great!
Below i have attched the files i mentioned above.
U_1CSE-35.tar.gz