biocommons / uta

Universal Transcript Archive: comprehensive genome-transcript alignments; multiple transcript sources, versions, and alignment methods; available as a docker image
Apache License 2.0
62 stars 26 forks source link

feat(IPVC-2231): add script to parse genomic gff #253

Closed sptaylor closed 6 months ago

sptaylor commented 6 months ago

This adds a new script for parsing gff. I read from the ticket that we wanted a new script, but if we'd like to replace the existing one, I can do that instead.

I added the shell scripts to run these steps just to demonstrate that they work. I figure we'll refactor this so we only pull the database once. The script runs in the full slice, but fails due to not having upstream step plugged in yet (I believe that is the gbff processing step from here)


DETAIL:  Key (tx_ac)=(NM_001089591.2) is not present in table "transcript".```