calico / scBasset

Sequence-based Modeling of single-cell ATAC-seq using Convolutional Neural Networks.
Apache License 2.0
89 stars 12 forks source link

Fasta file #8

Closed fym0503 closed 1 year ago

fym0503 commented 1 year ago

Hi, You mention that Genome fasta file can be downloaded from UCSC. Could you please specify which files to download because there are a great number of files? Thanks

hy395 commented 1 year ago

Hi,

Sure. What you need is only the .fa file. For example, for hg38, you only need the hg38.fa.gz file from here: https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/, and then just gunzip it.