Open bryce-turner opened 2 years ago
This should really be based on a program that can read a GTF, maybe look if we can adopt the container we were working to sort out, it has lots of functions to parse gff and gtf files that might simplify this, otherwise you are correct we need to discover the position
https://github.com/tgen/jetstream_resources/blob/6ffe2681fef962a8ba0881842aaf1aafd9b4469b/shared_resource_creation_scripts/create_gene_model.sh#L247-L258
We grab the value in column 10, e.g. $10, but this value is not always the gene_id value. For example we have the following for canfam3.1 ensemble 98:
Instead we might be able to grab the column for gene_id and add 1. For example: