calico / basenji

Sequential regulatory activity predictions with deep convolutional neural networks.
Apache License 2.0
410 stars 126 forks source link

Mouse Cell Atlas reads are aligned to mm9 whereas basenji predicts from mm10 #124

Closed GrossTor closed 2 years ago

GrossTor commented 2 years ago

The Cross-species regulatory sequence activity prediction project uses bigwig tracks from the mouse cell atlas. Those were generated from ATACseq reads that were aligned to the mm9 genome assembly. However, basenji tries to predict those tracks from the sequence of the mm10 genome assembly (other mouse tracks e.g. from Fantom5 are aligned to mm10 as well).

Could you explain how this discrepancy was resolved? I see multiple solutions:

What would you recommend?

davek44 commented 2 years ago

That's correct. Crossmap seems good. I converted to bedgraph, then used UCSC liftover.

GrossTor commented 2 years ago

Makes sense. Thank you!