Closed ryandikdan closed 3 years ago
Silly me! I was using the gtf instead of the txt file that the fetch_ucsc.py script generated! With that said I had to use the GENCODE known genes list instead of refseq since the refseq annotations wouldn't download through the script. I'll close this issue, but for anyone having the same issue just use the kg or GENCODE annotations! (I think they're more comprehensive anyways, at least from my cursory searching.)
Hello, I was unable to fetch the appropriate files, but I did just look in the python script and downloaded the ref gtf manually. I guess this is the same as using the Table browser. More importantly, using a gtf from the table browser, or the one from that fetch_ucsc script, I get this error when running the parser:
It seems that the gtf isn't formatted properly or something since it's expecting a number and is getting a RefSeq number. Perhaps it isn't splitting properly cuz I don't have the right python packages? I did generate a fresh conda environment in python 2.7 and install via conda so the dependencies should be fine.