iobio / gene.iobio

Gene.iobio vue
MIT License
55 stars 11 forks source link

Genes with no transcripts #964

Closed AlistairNWard closed 10 months ago

AlistairNWard commented 10 months ago

People are still getting confused about what is happening when genes have no transcripts. Go to the demo data for gene TRA as an example. Gene.iobio will spin forever on TRA (if you click on it while it's loading RAI1, it will show 4 variants in the count and spin forever - 4 is the number of variants in RAI1). Here are some things I think we should do:

tonydisera commented 10 months ago

The gene TRA presents an interesting scenario. There is a gene in the RefSeq gff, but no transcripts. So without a transcript, we can't do any of the annotate variants analysis because transcript is required. It would take a significant amount of effort to rework annotate variants to work without a transcript.

I will fix the bug so that gene doesn't attempt to annotate the variants since there is no transcript. It will fail gracefully with a message that indicates there are no transcripts for this gene.

There is an existing issue to rework the gene model so that genes that are not in RefSeq or Gencode (or genes that don't have transcripts) and better handled in gene.iobio and other iobio apps. #839

AlistairNWard commented 10 months ago

Ok. We should include in the message that no variants are being shown because of this. The implication to the user is that there are no variants in the gene, but that isn't necessarily the case - we're just not showing any if there are no transcripts. Longer term, just displaying the variants would be good.

AlistairNWard commented 10 months ago

As to the first point, we need to make it as clear as possible that a lack of transcripts is not a bug, but a lack of information that makes gene.iobio unable to annotate the variants. I keep getting lots of messages from people about this message and I have to keep explaining to people what is going on.

tonydisera commented 10 months ago

Test with genes: TRA, WARS, IGH

Also, see how behavior and error message is slightly different for genes on alt contigs: HYDIN (RefSeq GRCh38)

AlistairNWard commented 10 months ago

@tonydisera How much effort would it be to just skip the annotation step? For example, IGH does have variants that appear in ClinVar so seeing the variants that fall into this region would still be useful without the annotations. I think you said this is a reasonable amount of work, so it might need to be broken out into its own issue for later, but just wondering.

The alternative solutions are to remove these genes from the list (I'm not a big fan of this as people will still complain - just that they can't pick their gene of interest), or suggest other genes. For example there are a million genes in the same region as IGH that do have transcripts.