Closed amvarani closed 2 years ago
Hello Alessandro,
I think the problem here is that the folder you are supplying as an argument to from_list_gene_data_and_hdf5_dir
also contains the GeneData, TEData, and overlap h5 files. It needs to be a folder of just the standard output alone. I.e chr1.h5
etc.
I will modify the docstring and error message of the from_list_gene_data_and_hdf5_dir
function to be more specific for future users but I won't be able to get to it until later next week or so.
Thanks a lot! I realized that and fixed it! The script is now working!
I have another question: In your manuscript, you stated something like this regarding figure 2:
"However these observed density differences in upstream and downstream values are not significantly different based on a chi-square test (χ2 = 0.0056; p-value ≥0.9). "
When I run the chi_squared_density_comparison.py, the results provided are a tsv file contaning two columns (see below)
How can I calculate the p-value and x2 values ?
For instance:
chr4_Up chr4_Down 0.0 0.0 0.0 0.0 0.0 0.62637365 0.62637365 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.994006 0.13886113 0.2977023 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.22277722 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
Hello Alessandro,
I apologize, I wish I could be more help here. That chi_squared_density_comparison.py
code is messy, as it was done right before the holiday season last year at the request of a reviewer and I never got around to instituting a better method. I envisioned that user's would implement their own statistical analysis of these values in R or Python, this is something you will need to do on your own here.
That being said, I would suggest you check out the scipy module for their stats packages if you plan to conduct your analysis in Python. They have good documentation on their functions.
Hello Alessandro,
May I close this issue?
Hi there I'm trying to run the chi_squared_density_comparison.py script, but got the error below.