biobricks-ai / eutoxrisk-temposeq

Data from Biostudies EUToxRisk studies
0 stars 1 forks source link

pathways Parquet: some of the p-values are 0.00 #4

Open zmughal opened 2 months ago

zmughal commented 2 months ago

Need to check this in case there was an error in conversion.

asmaa-a-abdelwahab commented 2 months ago

I think this issue exists in the dataset parquet not the pathway parquet. In gene expression analysis using TempO-Seq or similar platforms, there is a presence of entries without logFC (log fold change), p-value, and adjusted p-value that typically indicates that there was insufficient data to perform a statistical comparison for those genes. If the issue is confirmed to be in the dataset parquet, we can add line to filter the empty entries from the dataset before saving.